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Preface to the Second Edition 


DURING THE SIX YEARS SINCE THE PUBLICATION OF THE FIRST 
edition, there has been a substantial increase in the attention paid by 
research workers to the principles of experimental design. Evidence of 
this change can be seen in the medical and social sciences, in physics 
and chemistry, and in industrial research. Another encouraging trend 
is that workers in these areas, although still willing to utilize appropriate 
designs taken from agricultural experimentation, have begun to examine 
their own problems of experimentation in a fresh light and to produce 
new designs better suited to their particular conditions. The recent 
literature also reflects a move toward greater depth and comprehensive- 
ness in experimental work, as instanced by numerous papers devoted to 
experimentation with more than one factor. 

In preparing a second edition we decided to leave the framework of 
the book unchanged. New material is included wherever necessary to 
fulfill our original objective of making available the types of design most 
likely to be useful in practice. 

Two chapters have been added. Chapter 6A deals with the fractional 
replication of factorial experiments. This technique, initially applied in 
agricultural field experiments, was described briefly in the first edition. 
In the intervening period, fractional replication has proved so produc- 
tive in exploratory research, particularly in industry, that a more com- 
plete account is justified. 

Chapter 8A is also concerned with experiments of the factorial type. 
It presents methods and new designs for experiments in which the fac- 
tors represent quantitative variables, measured on a continuous scale. 
In this situation it is often natural to think of the yield or response as 
related to the levels of the factors by some mathematical function which, 
if it could be found, would summarize compactly all the results of the 
experiment. The new methods are applicable when this function can 
be approximated, within the limits of the experimental region, by a 
polynomial of the first or second degree. The second half of this chapter 
gives an introductory account of experimental strategies for finding, as 
quickly as possible, the levels at which the factors must be set in order 
to obtain the maximum response. These techniques, produced for in- 
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dustrial research and development, have many potential applications in 
other areas. s 

The appearance of several series of new incomplete block designs 
during the last six years necessitated expansions in some of the later 
chapters. The investigator who has found these designs useful now has 
2 considerably wider selection. In fact, the large number of new designs 
forced us to revise one of our original objectives. When it appeared, the 
first edition contained the plans of substantially all the designs in com- 
mon use at that time. . To include the new plans, however, would involve 
devoting over 200 extra pages to plans alone. Even if this were done, 
our set of plans would soon become incomplete as further research is 
published. We have tried to handle this problem by giving, at the end 
of Chapter 9, an index to the incomplete block designs now available. 
For designs for which no plan is presented, the purpose of the design 
and the method of analysis are explained at the appropriate place in a 
later chapter, with references to the sources of the plans. The principal 
series of designs represented in this way are some new confounded fac- 
torial designs in single replication, partially balanced incomplete block 
designs (with elimination of either а single or à double grouping), the 
chain block and generalized chain block designs and some further de- 
signs related to the latin square. 

To cover other developments, a number of sections have been added. 
Section 2.22a presents a table for estimating the numbers of replications 
needed for comparing two treatments when the data are arranged in two 
classes, as dead or alive, sound or defective. Sections 4.15a and 4.27a 
give the methods of analysis with data of this type for completely ran- 
domized and randomized block designs. These sections fill a gap in the 
first edition, which gave methods of analysis only for continuous data. 

Sections 4.61a to 4.662 contain a discussion of the use of latin squares 
in adjusting for residual effects which may be present when the treat- 
ments are applied in sequence to the same subject. 

Topics that are presented more briefly are sequential experimentation 
(section 2.23a), the testing of effects suggested by the data (section 3.53), 
the problem of making several tests of significance in the same experi- 
ment (section 3.54a), Yates’ automatic method of computing factorial 
effect totals (section 5.24a), additional standard error formulae for split- 
plot experiments (section 7.22), the effects of errors in the weights on 
the recovery of inter-block information (section 10.12a), and the use of 
balanced incomplete block designs in taste and preference testing (sec- 
tion 11.1a). Some of these topics deserve more space than we have 
given them, but the number of pages added in this edition is already 
greater than we should have liked. 
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A number of users of the book, particularly those who use it for teach- 
ing, asked us to prepare this edition so that the new material is clearly 
identifiable. To accomplish this, we have numbered the new chapters 
as бА and 8A. Any new section, table, or plan carries the letter a, e.g., 
section 2.23a, table 11.1a. Where a minor revision or addition has been 
made within a section, the old section number remains unchanged. 

For permission to use data as examples we are indebted to the General 
Foods Corporation and to Doctors J. Doull and J. N. Tolentino. Con- 
siderable help has been received from staff members at our respective 
institutions and from colleagues elsewhere. 

The large sale of the first edition of a book that is not elementary has 
been highly gratifying. It testifies to the alertness of workers in the 
experimental sciences in seeking out any new ideas and methods that 
may make their investigations more efficient and productive. 


У. С. COCHRAN 
G. M. Cox 
May, 1957 


Preface to the First Edition 


WORK ON THIS BOOK WAS STARTED WHEN BOTH OF US WERE MEM- 
bers of the staff of Iowa State College. At that time requests were re- 
ceived rather frequently from research workers. Some wanted advice 
on the conduct of a specific experiment: others, who had decided to use 
one of the more complex designs that have been discovered in recent 
years, asked for a plan or layout that could be followed during the ex- 
perimental operations. Although the logical principles governing the 
subject of experimentation are admirably expounded in Fisher’s book 
The Design of Experiments, these requests indicated a need for a different 
type of book, one which would describe in some detail the most useful 
of the designs that have been developed, with accompanying plans and 
an account of the experimental situations for which each design is most 
suitable. Such a book is directed at the experimenter and is intended 
to serve as a handbook which is consulted when a new experiment is 
under consideration. 

Mainly on account of the war, slow progress was made. In 1944 we 
completed a mimeographed draft of which several hundred copies were 
distributed. Many helpful suggestions were made by readers. Of these, 
the most frequent was to the effect that we should include more material 
dealing with the statistical analysis of the results. In the mimeographed 
draft, our practice was to give references to worked examples of the 
analysis in cases where they could be found in the literature, and to 
present examples for only those designs whose analysis was not available. 
To this it was objected that many research workers did not have easy 
access to our references. 

This suggestion raised a difficult issue. To present a self-contained 
account of the analysis of variance in all its ramifications would make 
the book, it seemed to us, unwarrantably long and expensive. Conse- 
quently we have continued to assume that the reader has some knowl- 
edge of the principles of analysis of variance and of the computational 
methods involved. We have included a brief review of the basic. theory 
and an extensive set of worked examples of the analysis for both com- 
mon and less common types of design. Although strenuous efforts were 
made to obtain a selection of examples from diverse fields of research, a 
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preponderance from biology, and more particularly from agriculture, was 
dictated by our own experience in those areas. On several occasions we 
rejected an example which would have made an attractive addition to 
the scope because we did not feel sufficiently familiar with the conditions 
of experimentation to give a realistic account of the problems encoun- 
tered. 

Since courses of lectures on the design of experiments are being intro- 
duced in many colleges and universities, some teachers may be inter- 
ested in the potentialities of the book as a textbook. Several comments 
are prompted by our own experience in this connection. First, it will 
often be necessary for the teacher to provide a more systematic develop- 
ment of the analysis of variance than is given here. Second, interest in 
such а course is greatly enhanced by examples from an environment 
with which the listeners are familiar, and especially by examples of 
experiments that have been conducted by some of the listeners. Thus 
the teacher is well advised to build the course around his own examples, 
using those in this book mainly as supplementary material for the stu- 
dents. Finally, a selective use of the book is in order, because it contains 
much more material than can be covered in a typical one-quarter course, 
and because some difficult topics have been dealt with in the early parts 
of the book. 

We wish to express our gratitude to the many staff members at Iowa 
State College and North Carolina State College who helped us by putting 
experimental data at our disposal, by providing painstaking descriptions 
of their experimental techniques, or by lending their assistance in the 
preparation of the manuscript and plans. For similar kindnesses in con- 
nection with certain of the examples we are indebted to G. F. Potter, 
D. Y. Solandt, D. В. DeLury and J. Hunter, F. M. Wadley and C. F. 
Rainwater, and the Wailuku Sugar Company, Honolulu. Some theo- 
retical results used in Chapter 14 were developed from research con- 
ducted at Raleigh under a contract with the Office of Naval Research. 
Finally, our thanks go to George W. Snedecor, who participated in the 
original plans for this book and made a careful reading of the first draft. 


У. С. COCHRAN 


С. М. Сох 
January, 1950 
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CHAP TLE Ва 1 


Introduction 


11 The Contribution of Statistics to Experimentation 


1.11 Тһе Problem of Interpretation. Since statisticians do not usually 
perform experiments, their claim to attention when they write on this 
subject requires some explanation. It is true that on many important 
aspects of experimentation the statistician has no expert knowledge. 
Nevertheless, in recent years, research workers have turned increasingly 
to statisticians for help both in planning their experiments and in draw- 
ing conclusions from the results. That this has happened is convincing 
evidence that statistics has something to contribute. 

At first, requests for assistance were nearly always concerned with 
the interpretation of the results. It is a common characteristic of ex- 
periments in widely diverse fields of research that, when they are re- 
peated, the effects of the experimental treatments vary from trial to 
trial. This variation introduces a degree of uncertainty into any con- 
clusions that are drawn from the results. Even after a number of repeti- 
tions, or replications as they are called, the investigator still does not 
know by how much his results would be changed if the experiment were 
repeated further under the same conditions. Successive trials may be so 
discrepant in their results that it is doubtful which of two treatments 
would turn out better in the long run. 

As an illustration of this variation, data are given i in table 1.1 from a 
simple experiment. The data are the times (minus 2 minutes) required 
to perform a routine statistical calculation, that of finding the sum of 
squares of 27 observations. Each sum of squares was computed sep- 
arately by the same person on both of two standard computing machines. 
In all, 10 different sums of squares were worked, making 10 trials or rep- 
lications of the experiment. It will be noted that the differences in 
speed range from 17 seconds in favor of machine В to 2 seconds in favor 
of machine A. Some experimenters may comment that the results of 
this experiment are remarkably well behaved, and exhibit nothing like 
the variation with which they have to contend. The results will, how- 


ever, serve our purpose. 
1 


2 INTRODUCTION 1.11 


TABLE 1.1 TIME IN SECONDS (MINUS 2 MINUTES) REQUIRED FOR COMPUTING 
SUMS OF SQUARES 


Machine 

m Difference 
Replication A B (A — В) 

1 30 14 16 

2 21 21 0 

3 22 5 17 

4 22 13 9 

5 18 13 5 

6 29 17 12 

7 16 L 9 

8 12 14 -2 

9 23 8 15 

10 23 24 —1 

Means 21.6 13.6 8.0 


The object of the experiment is, of course, to compare the speeds of 
the two machines for this calculation. Моге specifically, two objects 
might be stated. The first is to answer the question: is there any dif- 
ference in speed? or, to put it another way, to test the hypothesis that 
there is no difference in speed. The second object, which is related to the 
first, is to estimate the size of the difference in speed. Almost all ex- 
periments are carried out for one or both of these purposes—the testing 
of hypotheses and the estimation of differences in the effects of different 
treatments, 

As regards the test of the hypothesis that there is no difference in 
speed, we might report, as relevant evidence, that B proved faster 7 
times out of 10, A twice, while once there was а tie. For the problem of 
estimation we might report that the average difference in speed in the 
experiment was 8 seconds in favor of B. These purely descriptive state- 
ments do not carry us very far. Their weakness is that they supply no 
information about the reliability of the figures presented. For example, 
have we any confidence that, if the-experiment were continued for an- 
other set of 10 trials, the advantage at the end would still be close to 8 
seconds in favor of B? " 

Because of the deficiencies in the descriptive approach, experimenters 
have adopted a different point of view in the summarization of their re- 
sults. They tend to reason as follows. Suppose that it were feasible to 
continue the experiment indefinitely under the same conditions. The 
average difference in speed between the two machines would presumably 
settle down to some fixed value. This value, which will be independent 
of the size of the exneriment that was actually carried out, may reason- 
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ably be called the true difference between A and B. From this point of 
view, the problem of summarizing the results may be restated in the 
question: what can we say about the true difference between A and B? 
This is a problem in induction from the part to the whole, or in sta- 
tistical language, from the sample to the population. A solution to this 
problem has been developed by means of the theory of statistics. It is 
this solution that constitutes the principal contribution of statistics to 
the interpretation of the results. 


1.12 Statistical Inference. Obviously, it cannot be expected that the 
solution will provide the exact value of the unknown true difference. As 
a less ambitious goal we might hope to be able to find 2 limits within 
which the exact value is certain to lie, but even this cannot quite be at- 
tained. What can be done is that for any chosen probability, say .95, 
two limits are found such that the probability that they enclose the true 
difference is .95. In other words, limits can be found that are almost 
certain to enclose the true difference, where the degree of certainty, as 
measured by the probability, can be chosen by the experimenter. Since 
we wish to focus attention, on the type of inferential statement that can 
be made rather than on the method of calculating the limits, the com- 
putations will not be discussed at present. For the example in table 1.1 
they will be found in section 4.42. When the probability is .95, the limits 
for the true difference in speed between the 2 machines turn out to be 
3.3 and 12.7 seconds in favor of machine В. А statement that В is 
faster by an amount that lies between 3.3 and 12.7 seconds has a 1 in 
20 chance of being wrong. If the degree of certainty is decreased by 
lowering the probability to .8, the limits are narrowed to 5.1 and 10.9 
seconds. If the probability is raised to .99, the limits become 1.1 and 
14.9 seconds, and as the probability is brought closer to certainty, the 
limits steadily become farther apart. The limits are cailed confidence 
limits, and the probabilities are called confidence probabilities.* 

These probabilities are not merely academic abstractions: they can 
be subjected to at least a rough experimental verification. For verifi- 
cation, we need a situation where the true difference between the effects 
of two treatments is known. In toxicology, for instance, this situation 
can sometimes be obtained by diluting a standard poison to a known 
extent. The dilution is sent to the laboratory, labelled as an unknown" 
poison. By experiments on animals, confidence limits are found for the 


* Fisher (1.2) has developed statistical inference in terms of fiducial limits. The 
two concepts, fiducial and confidence limits, have different logical backgrounds, al- 
though in all simple applications to controlled experiments the actual values of the 
fiducial and confidence limits are identical. For a discussion, see Kendall (1.1). 
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amount of the unknown that has the same toxicity аз a given amount 
of the standard. Since the persons originating the experiment know the 
true value for this amount, they can verify whether the statement that 
the true amount lies within the limits is correct. The practical difficulty 
with this type of check is the labor required. A large number of experi- 
ments would be needed to verify whether about 95% of the statements 
made with confidence probability .95 were in fact correct, and about 
5% were wrong. 

As we have seen, the statistical solution to the problem of estima- 
tion consists of a statement that the true difference lies between certain 
limits, plus a probability that the statement is correct, It is of interest 
to consider whether this type of information is sufficiently precise to 
permit decisions of practical importance to be made. Although a 
thorough discussion of this question would be rather lengthy, inferences 
of this type often permit definite action to be taken with confidence 
that the action will be fruitful. When they fail to decide the point at 
issue, the reason is nearly always that the data obtained are insufficient. 
For illustration, suppose that it is desired to discover whether the ap- 
plication of a dressing of some fertilizer to a crop will be profitable. The 
cost of the fertilizer is such that its application will be profitable only if 
it increases yields on the average by 2 or more bushels. A series of ex- 
periments is carried out in order to estimate the true average response 
to the fertilizer. If the 95% confidence limits for the increase due to the 
fertilizer are 4 and 11 bushels, its use can be adopted with a good deal 
of assurance that it will be profitable. Similarly, if the 95% confidence 
limits are —5 and 1 bushels, a decision not to use the fertilizer follows. 
A case where there is uncertainty occurs when the limits are 0 and 5 


ment with the hypothesis, and the problem is to decide whether the 
discrepancy between the data and the hypothesis is to be ascribed to 
these variations or to the fact that the hypothesis is not true. The con- 
tribution of statistics is the operation known as a test of significance 
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Essentially, this is a rule for deciding, from examination of the data, 
whether or not to reject the hypothesis. Such rules are made to satisfy 
two conditions that are obviously desirable: (i) hypotheses that are true 
shall be rejected only very occasionally, and the probability of rejection 
may be chosen by the experimenter; (ii) hypotheses that are false shall 
be rejected as often as possible. 

This technique enables the experimenter to test his hypotheses about 
the action of the treatments, with the assurance that there is little risk 
of erroneously rejecting a hypothesis that happens to be correct. Prob- 
abilities of .05 and .01 are most commonly used for this risk, and in 
these cases the tests are said to be made at the 5 and 1% significance 
levels respectively. These levels are just useful conventions, and a lower 
probability may be used if the consequences of an erroneous rejection 
of the hypothesis are very serious. It should be remembered, however, 
that in lowering this probability value we automatically diminish the 
chance of rejecting a hypothesis that is false. 

A useful property of a test of significance is that it exerts a sobering 
influence on the type of experimenter who jumps to conclusions on 
scanty data, and who might otherwise try to make everyone excited 
about some sensational treatment effect that can well be ascribed to the 
ordinary variation in his experiment. On the whole, however, tests of 
significance are less frequently useful in experimental work than con- 
fidence limits. In many experiments it seems obvious that the dif- 
ferent treatments must have produced some difference, however small, 
in effect. Thus the hypothesis that there is no difference is unrealistic: 
the real problem is to obtain estimates of the sizes of the differences. 

The construction of confidence limits may add something to the in- 
terpretation of a test of significance. For instance, suppose that the 
speeds of the 2 machines in the example had not been significantly dif- 
ferent. It is a commonplace that this result would not prove that the 
2 machines were identical in speed. However, if the 95% confidence 
limits for the difference in speed were —2 and +4 seconds, we might 
argue that a true difference of 4 seconds, even if it existed, would be of 
no practical significance. Consequently, it could be said that for all 
practical purposes the 2 machines are identical in speed. This is much 
more positive and useful than the mere statement that the difference in 
speeds was not statistically significant. If, on the other hand, the con- 
fidence limits were —30 and +82 seconds, there is no justification for the 
conclusion that the machines can be regarded as equivalent. All that 
we have learned is that the data are not sufficiently accurate to show 
whether there is a difference in speed that is of practical importance. 

To summarize: variability in results is typical in many branches of 
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experimentation. Because of this, the problem of drawing conclusions 
from the results is a problem in induction from the sample to the popula- 
tion. The statistical theories of estimation and of testing hypotheses 
provide solutions to this problem in the form of definite statements that 
have a known and controllable probability of being correct. These 
statements are specific enough to be useful in deciding whether action 
can be taken on the basis of the results. 


1.13 The Function of Randomization. As would be expected, the type 
of statistical inference that can be made from a body of data depends on 
the nature of the data. It is easy to conduct an experiment in such a 
way that no useful inferences can be made, and many of the experiments 
brought to the statistician, particularly in earlier years, have been of this 
type. To take a simple example, suppose that in the comparison of the 
calculating machines each sum of squares had been computed first on 
machine A and then on machine В. Now it is quite possible that in- 
creased familiarity with the data will enable the second computation to 
be done faster than the first. The advantage is unlikely to be great in 
such an easy calculation; it could be so if the computation were more 
difficult. (In the experiment this advantage was estimated at about 4 
seconds, though the confidence limits for the true advantage are rather 
far apart.) 

1f the experiment is conducted in this way, the observed difference in 
Speed (B — A) is an estimate of the true difference, plus the unknown 
difference in speed between a second calculation and a first. Confidence 
limits set up by statistical techniques apply not to the true difference 
between B and A but to the true difference plus this unknown advantage. 
Consequently, the limits tell us nothing definite about the true difference 
between B and A. The interpretation of tests of significance also be- 


biases that are quite unexpected can occur from some apparently in- 
nocuous rule about the way in which different treatments are handled. 

Tn order to avoid these Ыазез we need some means of insuring that 8 
treatment will not be continually favored or handicapped in successive 
replications by some extraneous source of variation, known or unknown. 
This is done by the device known as randomization, due to Fisher. In- 
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stead of performing every calculation first on machine A, we apply the 
principle of randomization by tossing a coin to determine whether A or 
B shall be used first in any trial. The decision is made independently 
in each trial. The effect is that in any trial each machine has an equal 
chance of being tested under the more favorable conditions. Of course, 
the result of any specific randomization may favor one or the other 
treatment. But this happens only to an extent that is allowed for in the 
calculations that are used for tests of significance and confidence limits. 
This important result has been illustrated in detail by Fisher (1.2), who 
has shown how tests of significance and confidence limits can be con- 
structed, using only the fact that randomization has been properly ap- 
plied in the experiment. Randomization is one of the few character- 
istics of modern experimental design that appears to be really modern. 
One can find experiments made 100 or 150 years ago that embody the 
principles that are now regarded as sound, with the conspicuous ex- 
ception of randomization. 

The occasions on which randomization is required vary with the type 
of experiment and must be left to the judgment of the experimenter. 
One occasion arises when the treatments are allotted to the experimental 
material. Suppose that the effects of different diets on the heights and 
weights of children are to be ascertained. Since different children grow 
at different rates, a treatment that happens to be assigned to a group of 
fast-growing youngsters is favored. Consequently, we allot the diets in 
any replication at random to the children who are to receive them, with 
a new random allotment in each replication. Similarly, if 4 different 
oven temperatures for the cooking of roasts are under comparison, the 
4 temperatures are assigned at random to the 4-roasts which form the 
material for any replication. Sometimes this is the only randomization 
required, but frequently other operations that are carried out in the 
course of the experiment are also potential sources of bias. With a 
repetitious operation the order of events may be important, either be- 
cause a learning process is involved which tends to make later operations 
better than the earlier ones, or because fatigue tends in the opposite 
direction. Systematic biases may be guarded against by randomizing 
the order in which the operation is performed on the different treatments 
in a replication. In other cases the equipment that is used introduces 
variation. For example, if 4 ovens are available to cook the 4 roasts, 
we would not always use the same oven for the same temperature, in 
case biases should be introduced because of systematic differences among 
the ovens. Instead, the temperatures could be assigned at random to 
the ovens in each replication. Thus we have two randomizations, one 
to assign the temperatures to the roasts and one to assign them to the 
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ovens. If, however, we decide, before randomizing, which roast is to go 
in which oven, the two randomizations can be reduced to one. This 
method can always be used, if convenient, to cut down the number of 
randomizations that must be made. 

Randomization is somewhat analogous to insurance, in that it is a 
precaution against disturbances that may or may not occur and that 
may or may not be serious if they do occur. It is generally advisable to 
take the trouble to randomize even when it is not expected that there 
will be any serious bias from failure to randomize. The experimenter is 
thus protected against unusual events that upset his expectations. Of 
course in experiments where a great number of physical operations are 
involved, the application of randomization to every operation becomes 
time consuming, and the experimenter may use his judgment in omitting 
randomization where there is real knowledge that the results will not be 
vitiated. It should be realized, however, that failure to randomize at 
any stage may introduce bias unless either the variation introduced in 
that stage is negligible or the experiment effectively randomizes itself. 
1.14 Restricted Randomization. We now reconsider the randomiza- 
tion proposed for the experiment on computing machines, in order to 


discuss a criticism that presumably has occurred to the reader. When ' 


this experiment is being planned, we possess the knowledge that there is 
some advantage to the machine on which the second calculation of any 
sum of squares is made. In the light of this knowledge, it seems wise to 
make sure that each machine receives the advantage in five of the 10 
replications, rather than to leave this to be decided by the tossing of 
coins. In fact, if the decision is made by tossing coins, the chances are 
only about 1 in 4 that each machine will be used first in five replications; 
they are also about 1 in 4 that one machine will be used first in 7 replica- 
tions. The criticism is that the randomization is unlikely to give as ac- 
curate a comparison as the proposed alternative. The criticism is sound, 
and the experiment was actually planned so that each machine would 
be used first in half the replications. 

Randomization was also applied in the design actually used, though it 
is a different type from that originally proposed, being subiect to the 
restriction that each machine must appear first exactly five times. The 
process of randomizing therefore consisted in choosing five numbers at 
random from the numbers 1 to 10; these turned out to be 1, 3, 6, 8, and 9. 
Accordingly, machine A was used first in replications 1, 3, 6, 8, and 9. 
This is one of the cases where it is difficult to see why failure to random- 
ize could have led to any serious danger of bias. It probably would have 
been quite satisfactory to use machine A first in the first 5 replicates, and 
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machine B first in the remainder. Nevertheless, it might happen that 
the #уапќаре in the second calculation of a sum of squares would 
diminish in later replications; and, to save the trouble of trying to guess 
whether biases of this type are likely, the randomization was used as a 
precaution. 

Тһе example represents a fairly common situation, in which knowledge 
of the variation that will affect the results is sufficiently detailed so that 
the simplest type of randomization is objectionable on the grounds of 
accuracy. Most of the designs in this book were constructed for this 
situation. By suitable restrictions on the randomization, these designs 
enable the experimenter to utilize any knowledge that will increase the 
precision of the experiment. In addition, each design allows sufficient 
randomization to prevent biases from sources of variation about which 
knowledge is less certain. For each of the principal designs described, 
the appropriate method of randomization is also presented. 

One further point should be mentioned. We have seen that the ex- 
periment on computing speeds could have been carried out either by 
means of the simple randomization originally proposed or by means of 
the restricted randomization which ensures that each machine is used 
first five times. Both methods provide a valid test of significance and 
valid confidence limits. However, the calculations made for the test of 
significance and for the confidence limits are different in the 2 cases. 
The first experiment is of the type known as randomized blocks, and 
calculations are made as described in section 4.23; the second experi- 
ment is of the “cross-over” type, for which the calculations are given in 
section 4.42. This is an instance of the general rule that the way in 
which the experiment is conducted determines not only whether in- 
ferences can be made, but also the calculations required to make them. 
The experimenter must make sure that the calculations which he uses 
are appropriate for the experiment. 


1.2 Initial Steps in the Planning of Experiments 


1.21 Importance of the Initial Steps. It has been mentioned that 
statisticians are often asked for advice in making inferences from the 
results of experiments. Since the inferences that can be made depend on 
the way in which the experiment was carried out, the statistician should 
request a detailed description of the experiment and its objectives. It may 
then become evident that no inferences can be made or that those which 
can be made do not answer the questions to which the experimenter had 
hoped to find answers. In these unhappy circumstances, about all that 
can be done is to indicate, if possible, how to avoid this outcome in future 
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experiments. Consequently, it has come to be realized that the time to 
think about statistical inference, or to seek advice, is when the experi- 
ment is being planned. 

Participation in the initial stages of experiments in different areas of 
research leads to a strong conviction that too little time and effort is put 
into the planning of experiments. The statistician who expects that his 
contribution to the planning will involve some technical matter in sta- 
tistical theory finds repeatedly that he makes a much more valuable 
contribution simply by getting the investigator to explain clearly why 
he is doing the experiment, to justify the experimental treatments whose 
effects he proposes to compare, and to defend his claim that the com- 
pleted experiment will enable its objectives to be realized. For this 
reason the remainder of this chapter is devoted to some elementary 
comments on the subject of planning. These comments are offered with 
diffidence, because they concern questions on which the statistician has, 
or should have, no special authority, and because some of the advice is 
so trite that it would be unnecessary if it were not so often overlooked. 

It is a good practice to make a written draft of the proposals for any 
experiment. This draft will in general have three parts: (i) a statement 
of the objectives; (ii) a description of the experiment, covering such 
matters as the experimental treatments, the size of the experiment, and 
the experimental material; and (iii) an outline of the method of analysis 
of the results. 


1.22 The Statement of Objectives. This statement may be in the 
form of the questions to be answered, the hypotheses to be tested, or the 
effects to be estimated. The aim should be to make the statement lucid 
and specific. The most common faults are vagueness and excessive 
ambition, in the sense that a twenty-year research program would be 
required in order to realize the stated objectives. Often it is advisable 
to classify the objectives as major and minor, because some types of 
experiment give high precision for certain treatment comparisons but 
low precision for others. When the experiment represents a cooperation 
among people of different interests, this classification is particularly 
helpful in that it makes clear which objectives take priority and helps 
to avoid an unhappy compromise that is adopted in the hope of pleasing 
everybody. 

The statement should include an account of the area over which gen- 
eralizations are to be made, or in other words, of the population about 
which it is hoped to make inferences. If an experiment is to be con- 
ducted on persons suffering from some disease, are the results presumed 
to apply to the patients in a specific hospital, to patients in all hospitals, 
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or to all sufferers whether in hospital or not? This sort of question is 
crucial in applied research, where often the experimenter has some 
definite population in mind to which he would like to apply the results. 
Tt is obvious that worth-while inferences about an extensive population 
cannot be made from a single experiment. For instance, inferences made 
from the experiment on the two computing machines are restricted to 
the person who made the calculations and to the type of calculation 
made. There is no guarantee that the results would be the same for a 
different type of calculation or with different operators. Consequently, 
if the object were to find out which machine is the speedier in general 
use in a computing laboratory, the experiment that was made has only 
scratched the surface of the problem. 


1.23 The Experimental Treatments. We have used the term treat- 
ments to denote the different procedures whose effects are to be measured 
and eompared. In the selection of treatments it is important to define 
clearly each treatment and to understand the role that each treatment 
will play in reaching the objectives of the experiment. Some issues that 
arise in particular cases are as follows. 

1. Confusion is sometimes caused by failure to distinguish whether 
the object is merely to "spot the winner" among the different treatments, 
or whether in addition it is desired to find some clues as to why the 
treatments behave as they do. A good example is the experiment which 
demonstrates, or so we are informed, that each of the three treatments, 
whiskey and water, gin and water, and rum and water, taken orally in 
sufficient quantities, produces some degree of intoxication. By itself the 
experiment provides no information as to whether the intoxication is due 
to the water, to the other ingredient, or to the fact that the two are 
níixed. A more extensive experiment with additional treatments would 
be necessary to throw some light on this question. Although there are 
occasions when it is sufficient simply to discover which is the best of the 
treatments, experience suggests that even in strictly applied research 
progress is faster if experiments supply fundamental knowledge. Sim- 
ilarly, the criticism that a certain treatment should be omitted because 
it would not be used in practice is valid if the aim is to find the best of 
the “practical” treatments, but it is not valid if the “impractical” 
treatment will supply some needed information about the behavior of 
other treatments in the experiment. 

2. The specification of the treatments may raise difficult questions 
about the conditions under which the treatments are to be compared. 
Suppose that the object is to find out the effect of an application of 
sulphate of ammonia on the yield of some crop. It is well known that 
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this effect is likely to depend on the amounts of other important plant 
nutrients that are available to the crop. Consequently, it must be 
decided whether to add these nutrients so that they will be in abundant 
supply on all plots, or to test the nitrogen on the soil as it happens to be. 
The decision must be guided by the objectives of the experiment, and 
more particularly by the type of population to which inferences are to 
apply. Sometimes it is advisable to test the nitrogen in both the pres- 
ence and the absence of the other nutrients. One result of this decision 
will be that the experiment becomes also a test of the other nutrients, 
even though this may not have been part of the original plan. Experi- 
ments of this type, where one factor is tested over different levels of 
another factor, are known as factorial experiments and have come to play 
a prominent part in experimentation. They are discussed more fully in 
chapter 5. 

3. In some cases it becomes apparent that the treatments that can 
be tested in practice are not those that we would like to test. The follow- 
ing example is typical of many fields where human relations are involved. 
Suppose that it is desired to compare two methods A and B of teaching a 
foreign language, and that we have defined each method clearly and 
have decided how to measure the success achieved by each method. 
However, some of the teachers who are to participate in the test already 
use a method similar to method A and have strong beliefs that method 
B is inferior, whereas others already follow method B and have no use 
for method A. If the teachers are divided at random into the groups 
that are to teach the two methods, we may expect that some teachers 
will be assigned to teach a method in which they have no faith, and we 
may decide that this is not the kind of comparison that we had in mind. 
On the other hand, if we forsake randomization and allow teachers to 
use the method that they like, differences attributed to the methods may 
in fact be due to differences in the ability of the two groups of teachers. 
Given enough resources, the situation can be more thoroughly explored 
by having each method taught by those who like it, those who do not, 
and those who are neutral, so that there are six treatments instead of 
two. In practice, the resources may not permit this elaboration, and 
the issues that must be-faced are how best to use the resources available, 
and whether the experiment, as it can be done, is worth doing. Here 
again, the deciding factor is usually the use to which the results are to be 
put. 

4. Discussion may arise as to the need for a “control,” this term being 
used rather loosely for a treatment in which we are not particularly in- 
terested, but which may be needed in order to reveal, by comparison, 
whether other treatments are effective. Suppose that we wish to com- 
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pare the effectiveness of three treatments that are qualitatively similar, 
for example, three nitrogenous fertilizers all of which supply the same 
amount of nitrogen. The control would be a “‘no-nitrogen” treatment. 
Three cases may be distinguished. (i) The effectiveness of this type of 
treatment has been consistently demonstrated previously, and it ге- 
mains to discover which of the three qualities is best. There is no need 
for a control. (ii) The type of treatment is in general effective, but oc- 
савіопаПу the conditions of the test are such that it is not. For ex- 
ample, nitrogenous fertilizers may fail to produce responses in fields 
where the soil fertility is very high. In this case it will be well to add a 
control, which serves primarily to describe the conditions under which 
the fertilizers were tested. (iii) It may not be known whether the type 
of treatment is effective. The control should of course be included, and 
there may be a case for giving it more replication than the other treat- 
ments receive. An experiment of this type is analyzed in chapter 3, 
where the treatments were 4 types of soil fumigant, and the control (no 
fumigation) was replicated 16 times as against 4 for the other treatments. 
The effect is to obtain a more accurate estimate of the average response 
to fumigants, at the expense of some loss in accuracy in the comparisons 
among the fumigants. 

An interesting example of the illumination that is sometimes produced 
by the inclusion of a control has been reported by Jellinek (1.3). A 
headache remedy contained 3 ingredients, a, b, and c. In order to test 
whether ingredients b and с were necessary, the complete mixture (abc) 
was compared with (ac) and (ab). There were 199 subjects, each of 
whom was treated with each drug for a two-week period, the appropriate 
drug being given whenever the subject complained of a headache. Suc- 
cess was measured by the ratio of the number of headaches relieved to 
the number tested in the two-week period. The mean success rates were 
0.84 for (abe), 0.80 for (ac), and 0.80 for (ab); these figures show very 
little overall difference in effectiveness. 

The experiment also contained a “control”—an inert mixture which 
looked the same as the others but had no active ingredients. This was 
tested under the same conditions as the 3 drugs. No less than 120 sub- 
jects, or about 60%, reported at least some of their headaches relieved 
by the control, a result that is itself of some interest. Further, the 
control enables us to isolate the group of 79 subjects whose headaches 
were not relieved by an inert mixture. For this group, the mean success 
rates were 0.88 for (abc), 0.67 for (ac), and 0.77 for (ab), and could be 
shown to be significantly different. As the author comments: “Banal as 
it may sound, discrimination among remedies for pain can be made only 
by subjects who have a pain on which the analgesic action can be tested.” 
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И а control is required, it must be an integral part of the experiment 
so that results for the control are directly comparable with those for the 
other treatments. This point tends to be overlooked in experiments 
with human subjects when it is difficult or troublesome to assemble the 
desired number of subjects. For example, if a new drug is to be tested 
in some ward of a hospital, the recovery rate in the ward before the drug 
was introduced is not a satisfactory control, nor is the recovery rate in a 
different ward where patients happen to be receiving the standard drug. 
An observed difference between the efiects of the new and the standard 
drug might be due to differences in the severity of the disease or in the 
type of patient or in other aspects of the medical care in the two time- 
periods or the two wards. It is necessary to regard the new and the 
standard drug as two experimental treatments on an equal footing, and 
to use randomization, or one of the methods of restricted randomization, 
in assigning the two drugs to patients. 


1.24 Further Details. Other features of the experiment that should 
be included in the draft of the proposals are the number of replications, 
the types of experimental material to be used, and the measurements 
that are to be made. Since these primarily affect the accuracy of the 
experiment, discussion of them is deferred to chapter 2, which is devoted 
to the general question of accuracy. 

Finally, the draft should describe in some detail the proposed method 
for drawing conclusions from the results. This is the section that is 
most frequently omitted, though it isa valuable one. It may include a 
sketch of the analysis of variance, an indication of the tabular form in 
which results will be shown, and some account of the tests of significance 
to be made and the treatment differences that are to be estimated. In 
this process we verify which treatment comparisons are relevant to 
each of the stated objectives of the experiment. Attention is drawn to 
deficiencies in the set of treatments and to treatments that supply no 
information essential to the purpose of the experiment. 
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CHAPTER 2 


Methods for Increasing 
the Accuracy of Experiments 


2.1 Introduction 


The results of experiments are affected not only by the action of the 
treatments, but also by extraneous variations which tend to mask the 
effects of the treatments. The term experimental errors is often applied 
to these variations, where the word errors is not synonymous with 
"mistakes," but includes all types of extraneous variation. Two main 
sources of experimental errors may be distinguished. The first is in- 
herent variability in the experimental material to which the treatments 
are applied. We shall use the term experimental unit to denote the group 
of material to which a treatment is applied in a single trial of the ex- 
periment. The unit may be a plot of land, a patient in a hospital, or a 
lump of dough, or it may be a group of pigs in a pen, or a batch of seed. 
It is characteristic of such units that they produce different results even 
when subjected to the same treatment: these differences, whether large 
or small, contribute to the experimental errors. The second source of 
variability is lack of uniformity in the physical conduct of the experi- 
ment, or in other words, failure to standardize the experimental tech- 
nique. 

Neither the presence of experimental errors nor their causes need 
concern the investigator, provided that his results are sufficiently ac- 
curate to permit definite conclusions to be reached. In many fields of 
research, however, with the time and labor that can be given to an ех- 
periment, the results are so greatly influenced by experimental errors 
that only treatment differences that are large can be detected, and even 
these may be subject to considerable uncertainty. As a consequence, 
the investigation of methods for increasing the accuracy of experiments 
has played a prominent part in experimental research in recent years. 

The methods may be classified broadly into three types. The first is 
to increase the size of the experiment, either through the provision of 
more replicates or by the inclusion of additional treatments. The second 
is to refine the experimental technique. Thirdly, we may handle the 
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experimental material so that the effects of variability are reduced. 
This may be done by careful selection of the material, by taking ad- 
ditional measurements that provide information about the material, or 
finally by skillful grouping of the experimental units in such a way that 
the units to which one treatment is applied are closely comparable with 
those to which another treatment is applied. These various methods аге 
discussed in more detail in succeeding sections. 

At this point it is advisable to discuss briefly the concepts accuracy and 
precision. Although these words are regarded as roughly synonymous 
in most dictionaries, they have sometimes been assigned technical mean- 
ings, particularly in the natural sciences and engineering. The difference 
between the concepts may be illustrated by an experiment in which 
children are weighed оп a machine that has a bias of 1 lb. That is, if the 
vrue weight of a child is 46 Ib., repeated weighings of the child give read- 
ings which vary around 47 lb. The accuracy of a measurement of the 
weight of the child signifies the closeness with which the measurement 
approaches the true weight, 46 Ib. The term precision, on the other 
hand, is concerned merely with the repeatability of the measurements. 
Thus the precision of a measurement denotes the closeness with which 
the measurement approaches the average, 47 lb., of a long series of 
measurements made under similar conditions. It is clear that, if the 
bias is large, a measurement may be of high precision but of low ac- 
curacy.* 

Most of the devices mentioned in this chapter, including replication, 
additional measurements, and skillful grouping of the experimental ma- 
terial, operate on the precision of the experiment. In so far as the 
method of measurement remains biased, these procedures do not affect 
this bias. A refinement in technique may reduce it. Other things being 
equal, an increase in precision is accompanied by an increase in ac- 
curacy, though if a large bias is present a substantial increase in pre- 
cision may result in a trivial increase in accuracy. 

: For simplicity in presentation, we shall sometimes speak as if no bias 
18 present, that is, as if precision and accuracy are identical. Thus the 
phrase “true difference between two treatments” will mean the true 
difference as recorded by the measuring device that was actually used. 
This will be identical with the natural concept of a “true difference” 
only if the measurements are not biased in favor of one of the treatments. 


* For a discussion of the difficult problem of measuring precision and accuracy 
вее Shewhart (2.8). ni 


221 NUMBER OF REPLICATIONS FOR TESTS OF SIGNIFICANCE 17 


2.2 Number of Replications 


Whatever the source of the experimental errors, replication of the 
experiment steadily decreases the error associated with the difference 
between the average results for two treatments, provided that pre- 
cautions (such as randomization) have been taken to ensure that one 
treatment is no more likely to be favored in any replicate than another, 
so that the errors affecting any treatment tend to cancel out as the 
number of replications is increased. The rate at which the experimental 
error is reduced is predictable from statistical theory. The basic quan- 
tity used to measure experimental errors is the error variance per ex- 
perimental unit, which is defined as the expected value of the square 
of the error that affects the observation for a single experimental unit. 
The square root of this quantity is called the standard error per unit. If 
c? is the error variance per unit and there are г replications, the error 
variance of the difference between the means for two treatments is 
27? /т, and the corresponding standard error is М202/ Vr. This result 
is valid unless increased replication necessitates the use of less homo- 
geneous experimental material or of a less careful technique. 

In succeeding sections some advice is given on the choice of the number 
of replications for an experiment. To present realistic advice is not 
easy. Often the size of an experiment is limited by lack of resources 
or by the conflicting claims of other experiments. However, even if it is 
realized that an experiment must fall short of the precision desired, it is a 
good practice to try to estimate the degree of precision that will be at- 
tained and to present this information as part of the proposal for the 
experiment. 


2.21 Number of Replications for Tests of Significance. Consider the 
difference between the effects of a pair of treatments, which might be a 
standard procedure and some new procedure which it is hoped will prove 
better than the standard. The precision desired in an experiment may 
be specified in either of two ways. We may specify the size of the true 
difference which the experiment is to detect by means of a test of sig- 
nificance, or we may specify how closely we wish to estimate the true dif- 
ference, by stating the width of the confidence interval that we would 
like to have for the true difference. 

Although the specification used is to some extent a matter of individual 
taste, the approach by means of a test of significance is helpful mainly 
in the initial stages of a line of research. The reasoning in this approach 
is roughly as follows. If it can be established that the new procedure is 
superior to the standard procedure by at least some stated amount, say 
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20% of their mean, then we will have discovered a useful result. On 
the other hand, if the first experiment shows no significant difference 
between fhe two treatments, we will be discouraged from further in- 
vestigation of the new treatment. Consequently, the first experiment 
must be large enough to ensure that if the true difference is 20% or more, 
it is highly probable that we will obtain a significant difference. Note 
that in this approach we do not insist that the first experiment give a 
precise estimate of the true difference, but merely that it give a signifi- 
cant result at the chosen level of significance. 

The probability of obtaining a significant result depends on the true 
standard error т per unit, the number of replications 7, and the number 
of degrees of freedom n that the experiment provides for estimating the 
error variance. The exact calculation of the probability is rather com- 
plicated; tables have been given by Neyman et al. (2.1) and others. The 
argument used in the following example, though logically faulty, leads 
to an approximation that is good enough for most practical purposes. 

Example. The true standard error per unit is 12% of the mean of the 
experiment, and there are 8 replicates and 28 d.f. for the estimate of 
error variance. What is the probability of obtaining a significant result 
when the true difference is 20%? 

Let d be the observed difference and ô the true difference between the 
mean of the new treatment and that of the standard treatment. The 
true standard error of d is 


2? Ra 
М кш нев = 6% 


In practice we do not know c, and in Из place we use the estimated 
standard error per unit s, obtained from the analysis of variance of the 
results. We will assume that s happens to be equal to c. 

The test of significance of d is made by means of a Student t-test, with 
28 d.f. To find the value that d must attain in order to be significant, we 
multiply the standard error of d, 6%, by the significant value of t. 
Suppose that the test is made at the 5% significance level, and that it is 
a one-tailed безі.“ The 10% value of ¢ in the standard tables, for 

* A one-tailed test is used when it is known that the new treatment must be at 
least as good as the standard procedure. The difference d is declared significant 
only if d is sufficiently large and positive; if d is negative we ascribe the result to ex- 
perimental errors, since we do not entertain the possibility that the new treatment 
could be inferior. А two-tailed test is made when we do not know which treatment 
is better; large values of d, whether positive or negative, are declared significant. 
All standard tables of the significance levels of t are computed for two-tailed tests. 
For a one-tailed test we read the value of t corresponding to twice the significance 
probability. 4 
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28 d.f. is 1.701, so that if d is to be significant we must have 


2 
а> "m = (6)(1.701) = 10.206 
T 


The problem is to find the probability that this happens. 
The quantity 


follows the ¢-distribution with 28 d.f. But it is evident that 


| 4-90. 10.206 — 20 
d> 10.206 if and only if 4---->---- = —1.632 


Hence the probability which we зееК is the probability that a value of 
1, with 28 d.f., should exceed —1.632. In the table of t, the probability 
that a value lies outside the limits +1.632 is about .11. The proba- 
bility wanted is therefore [1 — (14)(0.11)], or about .945. It is highly 
likely that a significant difference would be found in this experiment. 

The flaw in the argument is the assumption that the value of 8 can 
be assumed equal to e, whereas s is actually a random variable. If the 
argument is carried through algebraically, it gives the following rule for 
calculating the approximate probability P of obtaining a significant re- 
sult. 

Step 1. Find % from the relation 


bw Pets + t) (2.1) 
Е 


where 5 = true difference that it is desired to detect. 
с = true standard error per unit. 
r = number of replications. 
tı = significant value of ¢ in the test of significance (with care to 
distinguish between one-tailed and two-tailed tests). 
Step 2. 
Р=1- Q9» (2.2) 


where рә is the probability corresponding to t in the ordinary t-table. 
The degrees of freedom in f, and tz are those available for the estimate of 
error variance. 
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TABLE 2.1 NuMBER OF REPLICATIONS REQUIRED FOR A GIVEN PROBABILITY ОР 
OBTAINING A SIGNIFICANT RESULT 


Upper figure: Test of significance at 5% level, probability 80% 
Middle figure: Test of significance at 5% level, probability 90% 
Lower figure: Test of significance at 1% level, probability 95% 


One-tailed tests 
True differ- True standard error per unit (о) as percent of the mean 
ence (5) as 
percent of 
the mean | 2 | 3 4| 5 6|7 8|9 10/11 12] 14 16/18 20 
+ vam Ч 
5 8| 6 9/13 19 | 25 33] 41 50 
4| 7 12118 26 | 35 45 
7 | 13 22 | 33 47 
10 212 3] 4 6| 7 9|11 13 |16 19] 25 33] 41 50 
213 415 8| 9 12 |15 18 |22 26| 35 45 
3 7| 9 13 |17 22|27 33/40 47 
15 2) 2 2) 3 38) 4 5| 6 7] 8 9 12 15|19 23 
212334567 9/10 12] 16 21/26 31 
21345 7| 8 10/13 15 18 22] 29 37 47 
20 212,21% 21331445 бгу 1 13 
2122233455679 12| 15 18 | 
2) 3 8) 4 4] 5 7) в 9|11 13| 17 22] 27 33 | 
25 2) 2 2-0” 2) OP ш! a «| 5 6| 7 9 
2|2 2) 2021853) 3 415 5| 7 в|ю 12 М 
21323334567 зо п 14|18 22 | 
0 |212 212 2/2 2/2 313314 5/67 
В а БАНА ШУ" 
2| 272198 31-3. О 6 7| 8 10/18 15 


The rule may be inverted in order to give the number of replications 
т required for a given probability P of obtaining a signifieant result. In 
this form we have 


r> 2 (2), + (23) 


where ô = true difference that it is desired to detect. 
7 = true standard error per unit, 
й = significant value of 2 in the test of significance. 
t = value of і in the ordinary table cr sresponding to 2(1 — P). 
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TABLE 2.1 (Continued) 
Two-tailed tests 


True differ- True standard error per unit (о) as percent of the mean 
ence (5) аз 
percent of 
the mean | 2/3 4/5 6|7 8 [9 10/11 12/14 16) 18 20 
5 4| 7 1117 24) 32 41 
5| 9 15 |22 31 | 42 
7 | 14 24 | 38 
10 218 415 7| 9 11] 14 17] 20 24) 32 41 
213 5| 7 9) 12 15118 22 | 27 31 | 42 
3| 5 7|1 14] 19 24] 30 37 | 45 
15 212 334567 810 11/15 19/24 29 
2123456171 9 11] 18 15 |19 25 |81 39 
3| 3 4| 6 7] 9 12) 14 17] 21 24 |88 42 
20 ‚А ^2 | Ж шылы Sa Sa ee À 
212 2| 8 3] 4 5| 6 7| 8 912 15|18 22 
213 3| 4 516 7| 9 11) 12 14119 24,30 37 
25 2| 2 2| 2 2] 8$ 8| i$ ше ого Т A 
2| 278) 9878 PrEP Б" 9781303 272715 
2| 2 313 4| 5 t|6 7| 9 0 13 16| 20 24 
80 212 2) 2 220 84.8, 4 -%],6 16.7, 8 
2| 272,2 | x rdi 2: ecu 304: © Ca onde 
2| 2 8|"8''9apoum Ebr gie Wwe 12116 "TY 


Notes. In constructing the table, it was assumed that the number of degrees of 
freedom for error is 3(r — 1); this would apply in а randomized blocks experiment 
with 4 treatments. 

No entries are given where more than 50 replications are required. 


The use of this equation is slightly tricky, because the number of 
degrees of freedom in 4 and tz depends on т. Trial and error may be used 
until the smallest r satisfying (2.3) is found. 

Example. In the previous example, how many replications are re- 
quired in order to have a chance of 4 in 5 that a significant result will be 
obtained? Неге we have ô = 20, с = 12, and we will suppose that the 
experiment has 4 treatments in randomized blocks, which gives n — 
3(r — 1) degrees of freedom for the estimate of error. In the t- table, 
1, is the value for probability .10, while & is the value for pri 

(1 — .80) or .40. 


B.C.E.R. [5 West Benga 


wn т 
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In order to start the calculation some guess must be made about the 
value of n. It does not matter greatly if the guess is inaccurate; we will 
try n = 30. For this, 


4 = 1.697; t2 = .854; sothat (l + t) = 2.551 


Hence 
r > 2(1269)?(2.551)? = 4.7 


Consequently our second trial value is r = 5. Since this provides only 
12 instead of 30 d.f., we must repeat the calculation to verify whether 
5 replications are sufficient. For 12 d.f. (и + tə) is 2.655. Thus 


2 (Je +h)? =2 ЕНЕ - 5.08 


Since this is greater than 5, we conclude that 5 replications are not quite 
enough, and that 6 is the smallest number of replications that will 
satisfy the conditions. We can be sure without further computation 
that 6 will satisfy the conditions, because the right-hand side of the in- 
equality always decreases when we increase 7. 

Table 2.1, computed by this method, shows the minimum number of 
replications for a range of values of 6 and с, and is intended to be used 
in the planning of experiments. Usually it is most convenient to think of 
6 and c expressed in percentages of the mean per observation for the 
experiment, and the range in the table has been constructed with this 
in mind. Since the value of r depends only on the ratio c/2, it does not 
matter what units are used for ? and с, provided that they are the same 
for both. There are three entries for any given pair of values of à and c. 
These show, in descending order, the replications required for a test of 
significance at the 5% level and a probability .80 of getting a significant 
result, for a 5% level and probability .90, and for a 1% level and prob- 
ability .95. The top half of the table applies to one-tailed tests, the 
lower half to two-tailed tests, 

Construction of the table Tequires some assumption about the size of 
the experiment. In the case presented the experiment has four treat- 
ments in randomized blocks, so that n = 3(r — 1). The values in the 
table change little if the experiment happens to provide more than this. 
If the experiment has only two or three treatments it is advisable to 
check the value of r by means of the formula. 

In the practical use of the table it is necessary to estimate the value of 
the true standard error perunit. Often this can be done fairly well from 


| 
| 


| 
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the results of previous experiments with the same kind of material. It 
should be realized that the probabilities to which the table applies will 
be correct only if the value of с is estimated correctly. If с is under- 
estimated, the probability of obtaining a significant result will of course 
be smaller than the presumed value. 

As an illustration of the reading of the table, consider the case in the 
last example, where 6 = 20, о = 12, P = .80, and the test is a one-tailed 
test at the 5% level. For this we read the top entry in the top half of 
the table for 5 = 20, с = 12, and find that 6 replications are necessary. 
From the middle figure we see that an increase to 7 replications in- 
creases the probability to .90. If the test is to be two-tailed, we read at 
the corresponding place in the lower half of the table, and find that 7 
replications are required for a probability .80 and 9 for a probability .90. 

The table brings out the value of any reduction in the standard error 
per unit. In fact, we cannot have a high probability of detecting a 5% 
difference with any reasonable number of replications unless the stand- 
ard error per unit is 4% or under. Differences of 20% or over can be 
detected in most cases without excessive replication. No entries have 
been inserted in the table for situations where more than 50 replications 
would be required, since experiments of this size are uncommon. 


2.21а Number of Replications for Data Arranged in Two Classes. 
Table 2.1a gives the number of replications required in experiments 
where the experimental units are classified into two classes, e.g., “Alive” 
or “Dead,” “Healthy” or “Diseased,” “Improved” ог “Not improved.” 
We are interested in comparing the proportions or percentages that fall 
into one of the classes under two different treatments. 

Example 1. A standard drug for treating some ailment has been suc- 
cessful in about 30% of the patients. A new drug is to be compared 
with the standard drug in an experiment. If the new drug has a true 
success rate as high as 40%, the physician would like to be fairly certain 
to obtain a significant result in the experiment. The test of significance 
of the difference in success rates is to be a two-tailed test at the 5% 
level, and the physician is content with a probability of .80 that a sig- 
nificant difference will be found. How many patients does he need? 

Enter the table for two-tailed tests with 


P, = smaller percentage of success = 30 
5 = size of difference to be detected 
= larger percentage of success — smaller percentage of success 


= 40 — 30 = 10 
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TABLE 2Ла NuMBER OF REPLICATIONS (WITH DATA ANALYZED IN TWO CLASSES) 
REQUIRED FOR A GIVEN PROBABILITY OF OBTAINING A SIGNIFICANT RESULT 


Upper figure: Test of significance at 5% level, probability 80% 
Middle figure: Test of significance at 5% level, probability 90% 
Lower figure: Test of significance at 1% level, probability 95% 


One-tailed tests 
Р; = 6 = Pa — Pj = larger minus smaller percentage of success 
smaller 
% 


success 5 10 | 15 | 20 | 25 | 30 | 35 | 40 | 45 | 50 | 55 | 60 | 65 | 70 


5 330 | 105 | 55| 35| 25/20|16|13/11/ 9| 8| 7| 6! 6 
460 | 145 | 76| 48| 34|26|21|17|15|13|11| 9| 8| 7 
850 | 270 | 140 | 89| 63 | 47 | 37 | 30 | 25 | 21 | 19 | 17 | 14 | 13 


10 540 | 155 | 76 | 47| 32 | 23 | 19 | 15 | 13111] 9| 8| 7| 6 


15 710 | 200 | 94| 56| 38| 27 | 21| 17| 14 | 12| 10| 8| 7| 6 


25 980 | 260 | 120 | 69 | 45 | 32 | 24 | 19 | 15| 12 | 10| 8| 7 


30 1080 | 280 | 130 | 73 | 47 | 33 | 24 | 19 | 15 | 12 | 10| 8 


35 1160 | 300 | 135 | 75| 48 | 33 | 24 | 19 | 15 | 12 | 9 


2960 | 750 | 340 | 190 | 125 | 85 | 61 | 46 | 35 | 27 | 21 


40 1210 | 310 | 135 | 76 | 48 33 | 24 | 18 | 14 | 11 


45 1230 | 310 | 135 | 75 | 47 | 32 | 22 | 17 | 13 
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TABLE 2.1а (Continued) 
Two-tailed tests 
Р = 5 = Pz — P, = larger minus smaller percentage of success 
smaller 
% 
success 5 10 | 15 | 20 | 25 | 30 | 35 | 40 | 45 | 50 | 55 | 60 | 65 | 70 
5 420 | 130 | 69| 44| 31 | 24 | 20 | 16 | 14 | 12| 101997 
570 | 175.| 93| 59| 42 | 32 | 25 | 21| 18 | 15 | 13 | 11 | 10 | 9 
960 | 300 | 155 | 100 | 71 | 54 | 42 | 34 | 28 | 24 | 21 | 19 | 16 | 14 
10 680 | 195 | 96 | 59| 41 | 30 | 23 | 19 | 16 | 13/11/10, 9| 7 
910 | 260 | 130 | 79| 54 | 40 | 31 | 24 | 21 | 18 | 15 | 13 | 11 | 10 
1550 | 440 | 220 | 135 | 92 | 68 | 52 | 41 | 34 | 28 | 23 | 21 | 18 | 15 
15 910 | 250 | 120 | 71| 48 | 34 | 26 | 21 | 17 | 14 | 12 | 10| 9| 8 
1220 | 330 | 160 | 95| 64 | 46 | 35 | 27 | 22 | 19 | 16 | 13 | 11 | 10 
2060 | 560 | 270 | 160 | 110 | 78 | 59 | 47 | 37 | 31 | 25 | 21 | 19 | 16 
20 1090 | 290 | 135 | 80 | 53 | 38 | 28 | 22 | 18 | 15 | 13 10, 9| 7 
1460 | 390 | 185 | 105 | 71 | 51 | 38 | 29 | 23 | 20 | 16 | 14 | 11 | 10 
2470 | 660 | 310 | 180 | 120 | 86 | 64 | 50 | 40 | 32 | 26 | 21 | 19 | 15 
25 1250 | 330 | 150 | 88 | 57 | 40 | 30 | 23 | 19 | 15 | 13 110 | 9|.. 
1680 | 440 | 200 | 115 | 77 | 54 | 40 | 31 | 24 | 20 | 16 | 13 | 11 | .. 
2840 | 740 | 340 | 200 | 130 | 92 | 68 | 52 | 41 | 32 | 26 | 21 | 18 
30 1380 | 360 | 160 | 93| 60 | 42 | 31 | 23 | 19 | 15 | 12 | 10 > 
1840 | 480 | 220 | 125 | 80 | 56 | 41 | 31 | 24 | 20 | 16 | 13 В 
3120 | 810 | 370 | 210 | 135 | 95 | 69 | 53 | 41 | 32 | 25 | 21 
35 1470 | 380 | 170 | 96 | 61 | 42 | 31 | 23 | 18 | 14 | 11 
1970 | 500 | 225 | 130 | 82 | 57 | 41 | 31 | 23 | 19 | 15 
3340 | 850 | 380 | 215 | 140 | 96 | 69 | 52 | 40 | 31 | 23 
40 1530 | 390 | 175 | 97| 61 | 42 | 30 | 22 | 17 | 13 
2050 | 520 | 230 | 130 | 82 | 56 | 40 | 29 | 22 | 18 
3480 | 880 | 390 | 220 | 140 | 95 | 68 | 50 | 37 28 
45 1560 | 390 | 175 | 96 | 60 | 40 | 28 | 21 | 16 
2100 | 520 | 230 | 130 | 80 | 54 | 38 | 27 | 21 
3550 | 890 | 390 | 215 | 135 | 92 | 64 | 47 | 34 
50 1560 | 390 | 170 | 93| 57 | 38 | 26 | 19 ` 
2100 | 520 | 225 | 125 | 77 | 51 | 35 | 24 + 
3550 | 880 | 380 | 210 | 130 | 86 | 59 | 41 ve 


26 METHODS FOR INCREASING ACCURACY 2.214 


Read the upper line, which corresponds to a 5% test with .80 proba- 
bility. This gives 
т = 360 


This is the size of each sample, so that 720 patients will be needed. 

Example 2. Тһе physician reports that he will be able to find at 
most 200 suitable patients in a reasonable length of time. What dis- 
criminating power does an experiment of this size have? 

Here we are given P, = 30, r = 100 (in each group), and want to find 
8. Looking further along the row in which P, = 30, we see that when 
$ = 20, г = 93, the reading nearest to r = 100. With à = 20, P, = 
30 + 20 = 50. An experiment with 200 patients has 4 chances in 5 of 
finding a significant result only if the new treatment gives a success rate 
close to 50%. The physician must decide whether an experiment with 
this degree of ability to discriminate is worth doing. 

As the examples show, use of the table necessitates an estimate of the 
percent of success expected for one of the treatments. If this treat- 
ment has been employed for some time, the estimate can be made from 
past experience. We must also insert the size of true difference 5 that 
it is desired to detect. It is sometimes difficult for the experimenter to 
decide what value of 6 is of practical importance to him. Several values 
can be tried; inspection of the necessary sample sizes may aid in reach- 
ing a decision. 

In the table, values of P; are given only up to 50%. If the success 
rates to be compared both exceed 50%, the table is entered in terms of 
failure rates. 

Example 3. A standard treatment gives 75% success and it is desired 
to “detect” a new treatment that has 85% success. For a one-tailed 5% 
test with probability 90%, we take 


P, = smaller percentage of failure = 15 
= size of difference = 85 — 75 = 10 


т = 270 in each sample (from the middle line 
of the one-tailed table) 


The sample sizes required in table 2.1а often come as an unpleasant 
shock. To detect a difference of 5% in success rates usually requires а 
total sample size running into the thousands. Small sample sizes will 
detect only gross differences. 

Table 2.1a assumes that the experimental units have been assigned at 
random to the two treatments. If the units have been grouped into 
pairs, such that the two members of a pair are highly comparable (ее 
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section 4.21), the required values of r are smaller than those read from 
the table. The reduction is not great unless the pairing is markedly 
effective. 

The table was computed from an approximation given in (2.1а). 


z> (Ki + K3)* 
2(sin^! 4/P, — sin™ МР»)? 


where Кі and Ko are the normal deviates corresponding to the sig- 
nificance levels о (the level of the test of significance) and 2(1 — P), 
respectively, P being the desired probability of obtaining a significant 
result. Values of r less than 30 were inserted by a computer program 
for the exact power, developed and kindly provided by M. H. Gail and 
J. J. Gart. The approximation was fairly consistently too low by 1 
unit in this region. Computed values of r up to 20 were therefore all 
raised by 1. 


2.22 Number of Replications for Prescribed Limits of Error. The 
confidence limits for 6, the true difference between the effects of two 


treatments, are 
2 
б=а- ,|-st (2.4) 
Y 


where 1 is the value in the /-table corresponding to the confidence prob- 
ability chosen and to the number of degrees of freedom n in the estimate 
of error. The quantity +/2st/+/r may be called the limit of error, since 
it measures the maximum distance between d and 6 for the confidence 
probability chosen. Since the limit of error depends on s, it is not known 
until the experiment is completed. If s is replaced by its average or 
expected value, we obtain the expected limit of error 


L= ^ fot (2.5) 
7 


The factor / enters because the average value of s is slightly less than о; 


1 
to a close approximation, f equals (% - =) ^ 
n, 


From relation (2.5) table 2.2 was constructed to show the numbers of 
replications required for various values of L and for confidence probabili- 
ties of .80, 90, and .95. As in table 2.1, the experiment was assumed to 
have 3(r — 1) degrees of freedom in the estimate of error; no entries were 


28 


‘TABLE 2.2 


Limit of 
error (+L) 
as percent 
of the mean 


10 


inserted where more than 50 replications are necessary, and the values | 
of L and c are expressed as percentages of the mean for the experiment. | 
This table may be used in the planning of an experiment where the 
object is to estimate the difference in the effects of two treatments. Sup- 
pose that the true standard error per unit is expected to be about 6%, | 
and that preliminary experiments have indicated that the true difference | 
between the effects of the two treatments is around 25%. In this case 
we would probably not be satisfied with a value of L any higher than 
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2.22 


NUMBER OF REPLICATIONS REQUIRED FOR GIVEN LIMITS OF ERROR IN 


THE DIFFERENCE BETWEEN TWO TREATMENTS 


Upper figure: Confidence probability .80 
Middle figure: Confidence probability .90 


Lower figure: Confidence probability .95 


True standard error per unit (о) as percent of the mean 
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5%; if L is 5% and if the observed difference is found to be 22%, we 
are confident that the true difference lies between (22 + 5)%, that is, be- 


| 
11 
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tween 17 and 27%. From the entry for о = 6, L = 5, we find that 6 
replications suffice for a confidence probability of 80%, 9 for 90%, and 
12 for 95%. 

It is important to reiterate that this table does not guarantee that the 
limits of error will be of the desired size, even if the value of c is fore- 
casted correctly. The actual limits will depend on s and may be either 
larger or smaller than L. They will be about the desired size on the 
average if в is guessed correctly. For methods giving more definite 
guarantees see (2.3) and (2.4.) Furthermore, the table makes it clear 
that small treatment differences of about 5% cannot be accurately 
estimated in a single experiment, at least with the values of с and r that 
normally prevail in many lines of research. For an accurate estimate of 
a 5% true difference, we would probably not want L any larger than 1%. 
This value was not included in the table since it demands great numbers 
of replications. 

With data arranged in two classes, an approximate formula for r, the 
number of replicates for each treatment, is 


а K*(P1Qi + P2Q2) 

"uS NE 
where K is the normal deviate corresponding to the chosen confidence 
probability and Q = 1 — P. 


2.23а Sequential Experimentation. Although this term does not have 
a precise technical meaning, it is used generally for experiments in which 
(1) the treatments are applied to the experimental units in some definite 
time-sequence, and (ii) the process of measurement is rapid, so that the 
yield or response on any unit is known before the experimenter treats 
the next unit in the time-sequence. А typical example is the experiment 
described at the beginning of this book (section 1.11). In this, 10 calcu- 
lations were done by a single computer on each of two desk computing 
machines in order to estimate the difference in speed between the two 
machines. The 20 calculations were performed one after the other in a 
definite order, and the time taken for any calculation was known before 
the next calculation was started. 

When an experiment is sequential, the experimenter can stop after 
every observation and examine the accumulated results to date before 
deciding whether to continue the experiment. In other words, the analy- 
sis can be done sequentially as well as the experiment. It seems plausible 
that a sequential analysis should be more efficient than the method given 
in the preceding sections, in which the number of replications needed is 
estimated in advance and no examination of the results is made until 
this number of replications has been completed. For instance, the ad- 
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vance calculation might call for 20 replications, but the superiority of 
one of the treatments might be fully evident on inspection after six 
replications. 

A general method of sequential analysis was developed by Wald (2.2a). 
This procedure gives rules that determine, after any number of observa- 
tions, whether to terminate or continue the experiment. If termination 
is indicated, the rules indicate the conclusion to be drawn. These rules 
provide the same guarantees of sensitivity that we were seeking in the 
calculations of number of replications in sections 2.21 and 2.21a. For 
instance, with data classified in two classes, the experimenter may choose 
the quantities 


P, = percent success under the standard treatment, 


6 = P3 — P, = degree of superiority of the new treatment 
which it is desired to detect 


A sequential paired experiment can be constructed such that if the new 
treatment actually is superior by ô percent, the probability of obtaining 
a significant difference in favor of the new treatment will be at least 80% 
or 90% or any desired value. 

The chief practical advantage of sequential analysis is that for the 
same guarantee of sensitivity fewer replications are required, on the 
average, than with an experiment of pre-determined size. The extent 
of the saving in replications varies from case to case, but it appears to 
average about 30 to 40%. A reduction of this amount is highly accept- 
able when observations are expensive. 

So far as we have been able to discover, sequential: experimentation 
of this type has not been widely used. Several explanations can be put 
forward. Although the method is some 10 years old, it may not yet be 
well known to experimenters. The available techniques have thus far 
been restricted to experiments with two treatments. Experiments in 
which the observations become known in a strictly-sequential fashion 
may be infrequent. 

With data arranged in two classes, the sequential technique is de- 
scribed іп (2.2а), (2.3a) and (2.4а). Тһе last two references discuss pos- 
sible applications to medical research. "Tables for sequential analysis 
with continuous data (the sequential t-test) are given in (2.5a) and (2.6a). 
In all these cases the data must be paired into separate replications. 
Rushton (2.102) gives a sequential t-test for unpaired samples. 

Sequential methods have also been produced for certain specialized 
types of experiments. An example is the “up and down" or “staircase” 
method of sensitivity testing (2.72, 2.82). The objective is to estimate 
the amount of some stimulus to which 50% of the experimental subjects 
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give a positive response. А series of increasing stimuli, so, so + d, 
зо + 2d, etc., equally spaced on some scale, is selected. The subjects 
are tested in turn, each subject being exposed to a single stimulus. The 
stimulus 8; for the ith subject is determined by the following sequential 
rule. Let the stimulus for the preceding subject be s;1. Then 
Sj 1 + d, if the preceding subject fails to respond 
5; — d, if the preceding subject responds 
This rule automatically concentrates testing in the neighborhood of the 
stimulus level giving 50% response, and makes for efficient estimation. 
Another example is found in the comparison of the lengths of life of 
two pieces of equipment under operating conditions. Goodman (2.92) 
has pointed out the advantages of an experimental plan in which each 
piece of equipment that fails is replaced by a piece of the opposite type. 
In the broadest sense of the term, all experimentation is sequential. 
Before deciding what scientifie questions he will attempt to answer in 
an experiment, the competent worker considers the knowledge and spec- 
ulations gained from his own and others' previous experiments. 


2.3 Other Methods for Increasing Accuracy 


2.31 The Measurement of Relative Efficiency. We have seen that 
the error variance of the difference between the means for two treat- 
ments is 2c?/r, where r is the number of replications. This result pro- 
vides à simple means for comparing, in concrete terms, the relative pre- 
cision of two experiments. Suppose that one design gives a true error 
variance of 1.0 per experimental unit, while a second gives 0.5. The 
two experiments would give equally precise comparisons among the 
treatment means if the amount of replication in the first were twice that 
in the second. From a knowledge of the error variances per unit, com- 
parisons are thus easily made in terms of the relative amounts of replica- 
tion required to attain the same degree of precision. The inverse ratio 
of the variances per unit is sometimes called the relative efficiency of the 
two designs; in the example above, the relative efficiency of the second 
experiment to the first is 2.0 or 200%. Similarly we might speak of the 
relative gain in efficiency as 1.0 or 100% in this case. These terms occur 
frequently in the literature on experimental designs. 

One further factor must be taken into account. For the same numbers 
of treatments and replicates, the number of degrees of freedom in the 
estimate of error changes with the design. This number enters into many 
of the uses to which the experimental results are put. When the number 
of degrees of freedom becomes smaller, the limits of error for a true 
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difference are increased and the probability of obtaining a significant re. 
sult is decreased; in other words, the sensitivity of the experiment is de- 
creased. 

Thus the numbers of degrees of freedom for error are relevant to the 
comparison of two designs. A change in design which decreases the error 
degrees of freedom as well as the error variance may not be advantageous. 
This leads to the question: by how much must the experimental error 
variance be reduced so as to balance a given reduction in the error degrees 
of freedom? The question has been discussed by Neyman et al. (2.1), 
Fisher (2.5), and Walsh (2.6). The answer depends on the use to which 
the results are put. From one point of view, two experiments may be 
regarded as equal in sensitivity when the limits of error for a given con- 
fidence probability are equal. Alternatively, equal sensitivity may 
imply an equal probability of detecting as significant a given real dif- 
ference between the effects of two treatments. Other definitions could 
be set up. 

Table 2.3 illustrates the situation. An experiment with a true error 
variance per unit equal to 1 (i.e., with an infinite number of degrees of 
freedom) was chosen as the standard. The table shows, for a range of 


TABLE 2.3 EFFECT OF NUMBER OF DEGREES OF FREEDOM IN ERROR ON THE BEN- 
SITIVITY OF THE EXPERIMENT. ERROR VARIANCES PER UNIT WHICH GIVE EQUAL 
SENSITIVITY ACCORDING TO FOUR DEFINITIONS 


Number of error degrees of freedom (n) 
Defini- 
tion 
1 2 3 4 5 6 8 10 15 | 20 | 30 | = 
1 0.04 | 0.26 | 0.45 | 0.56 | 0,64 | 0.70 | 0.77 | 0.81 | 0.87 | 0.90 | 0.94 | 1.00 
2 0.27 | 0.59 | 0.72 | 0.79 | 0.83 | 0.86 | 0.90 | 0.92 | 0.94 | 0.96 | 0.97 | 1.00 
3 0.22 | 0.47 | 0.61 | 0.69 | 0.75 | 0.79 | 0.84 | 0.86 | 0.91 | 0.93 | 0.95 | 1.00 
4 0.10 | 0.40 | 0.57 | 0.67 | 0.73 | 0.78 | 0.83 | 0.87 | 0.91 | 0.93 | 0.96 | 1.00 


values of the error degrees of freedom (n), the true error variances per 
unit which are required to equal the standard in sensitivity according 
to four definitions of this term. In case 1 the average limits of error for 
а confidence probability .95 are made equal. In case 2 the same cri- 
terion is used for a confidence probability .80. In case 3 all experiments 
have the same probability .25 of detecting as significant a constant dif- 
ference between two treatments. Case 4 shows the same comparison for 
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a larger constant difference where the probability is .75. The tests of 
significance are one-tailed t-tests at the 5% level. 

For example the figure 0.70 for n = 6 (definition 1) means that with 
a true error variance per unit of 0.70 and 6 d.f. in the estimate of error, 
the limits of error for a confidence probability .95 are the same on the 
average as when c? is 1 and т is infinite. А 30% decrease in error vari- 
ance is needed to compensate for the lack of degrees of freedom. 

The wide divergence between the results for definition 1 and defini- 
tion 2 is noteworthy since the only change in criterion is that from а 
confidence probability .95 to one of .80. Definitions 3 and 4 exhibit 
much closer mutual agreement and give results intermediate between 
those of definitions 1 and 2. In practical applications the discrepancies 
among the four methods will be smaller than table 2.3 suggests, be- 
cause comparable designs for the same experiment seldom differ greatly 
in the degrees of freedom available for error. 

Fisher’s approach (2.5) is somewhat different. In effect, he calculates 
the “amount of information” which the estimated difference d between 
two treatment means supplies about the true difference 5, the calcu- 
lation being made from the ¢distribution of the quantity 


Vr(d —8) 
м 
He finds the information to be (п + 1)/(n + 3)s?, whereas if с were 


known exactly the information would be 1/c?. The estimated variances 
which give equal amounts of information are shown in table 2.4. 


TABLE 2.4 ESTIMATED VARIANCES WHICH PROVIDE EQUAL AMOUNTS OF 
INFORMATION 


Number of error degrees of freedom (n) 


1|2]|]3|4|8|8$|7|8]|9.]|1]|1 |12 | 15 | 309 | 30 | @ 


0.500 | 0.600 | 0.657 | 0.714 | 0.750 | 0.778 | 0.800 | 0.818 | 0.833 | 0.846 | 0.857 | 0.867 | 0.889 | 0.913 | 0.939 1 


'This table agrees closely with the results for cases 3 and 4, except 
when n is below 5. For general purposes it is suggested that this table 
be used to take account of the differences in degrees of freedom for error 
in two designs that are being compared. Suppose that a design with 
т = 6 is compared with one in which п = 12. The former gives more 
information only if the error variance sı? is less than (0.778/0. 867) 82”. 
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Thus the relative efficiency of the first design to the second is estimated 
as 0.778s2"/0.867s;7, or more generally as 

(т + 1)(na + 3)s2? 

(na + 1)(m + 3)s? 
The adjustment is of importance only if nı and тә are small. 

"This expression applies to the simple case in which the two experi- 
ments have the same set of treatments. ‘The experiments may, however, 
have different sets of treatments, although each provides an estimate of 
Some specific treatment effect іп which we are interested. Alternatively, 
а different method of estimation of the treatment effect may have been 
employed in the two experiments. In such cases we may wish to assess 
the relative accuracy with which this treatment effect is estimated in 
the two experiments, The expression given above still applies, except 
that sı? and 3? become the estimated variances of the treatment effect 
as found in the two experiments, while ny and пә are as before the num- 
bers of degrees of freedom in 81? and 822 respectively. 


Іп succeeding sections a number of other devices which affect the | 


accuracy of an experiment are presented briefly. 


2.32 Selection of Treatments. In certain cases the selection of the | 


treatments has a substantial effect on the precision of an experiment. 


As Fisher has stressed, Striking gains in precision may be achieved by | 


testing different types of treatment in the same experiment, instead of 
conducting a separate experiment for each type. For instance, one 
type of treatment or factor might be the depth of ploughing in the 
Preparation of a field for a crop, while another might be the addition of a 
nitrogenous fertilizer. We might conduct one experiment in which 
deep (D) and shallow (S) ploughing are compared, and another in which 
no fertilizing (0) is compared with the addition of a specified amount of 
fertilizer (М). In a factorial experiment both factors would be tested 
simultaneously by means of the 4: "treatments"—(D0), (SO), (DN), and 
(SN), where the symbol (DO) implies deep ploughing with no addition 
of nitrogen, etc. The average response to nitrogen is assessed by com- 
paring the last two treatments with the first two. This comparison 
gives as precise an estimate of the average response to nitrogen as if the 
whole experiment had been devoted to that factor alone. The same 
property holds for the estimation of the average difference in effect be- 


more fully in chapter 5. 
There are many specialized problems where the choice of the proper 


| 
| 
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amounts of some ingredient is important. As a very simple example, 
suppose that the response to increasing amounts of the ingredient is 
known to be linear and that the purpose is to determine the slope of the 
line. The most accurate experiment contains only two amounts of 
ingredient. These should be placed at the ends of the range within which 
the response is linear and experimentation is feasible. With three 
amounts of ingredient, two at the ends and one in the middle, the 
variance of the estimated slope, for the same total size of experiment, is 
114,times as large, and if more amounts are used it becomes larger still. 
A more complex example occurs in the estimation of the amount of a 
virus in a solution by preparing a series of dilutions of the solution and 
counting the numbers of lesions produced on leaves which are rubbed 
with the dilutions (2.7). The accuracy of the estimate is known to 
depend both on the dilution ratio that is chosen and on the number of 
dilutions that are used. 


2.33 Refinements of Technique. Since technique is the responsibility 
of the experimenter, its importance need not be elaborated. The prin- 
cipal objectives of a good technique are as follows. 

i. To secure uniformity in the application of the treatments. In 
pig-feeding experiments, for instance, a uniform amount of food cannot 
be supplied to each animal without the provision of individual feeding 
boxes. In the testing of insect sprays, delicate apparatus is required 
in order to subject each batch of insects to a desired dose of spray. 

ii. To exercise sufficient control over external influences so that 
every treatment produces its effects under comparable and desired соп- 
ditions. It is difficult to generalize about the degree of control needed; 
a balance must be struck between the cost incurred and the gain in 
precision obtained. The artificial production of diseases for experiments 
on resistance to infection exemplifies a case where experimentation can- 
not proceed rapidly without such control over external conditions. 

iii. To devise suitable unbiased measures of the effects of the treat- 
ments. Often the appropriate measurements are readily apparent; 
sometimes the development of a satisfactory method of measurement 
requires years of research, as in the estimation of certain of the vitamins, 
in soil analysis, and in sociological investigations. 

iv. To prevent gross errors, from which no type of experimentation 
seems to be entirely free. Adequate supervision and checking of the 
work of assistants and a scrutiny by the experimenter of the data from 
every experimental unit will go far towards the discovery and rectifica- 
tion of errors. 

Faulty technique may increase the real experimental errors in two 
ways. It may introduce additional fluctuations of a more or less 
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random nature. Such fluctuations, if they are substantial, should reveal- 
themselves in the estimate of error as calculated in the analysis of 
variance (chapter 3). Where his estimated standard errors are con- 
sistently higher than those of other workers with similar material, the 
experimenter is advised to seek the reason, which may lie in differences 
in technique. In addition, faulty technique may result in measurements 
that are consistently biased. The estimate of error does not take ac- 
count of such biases, since it is derived from comparisons of the measure-_ 
ments with one another; in other words, it estimates precision rather 
than accuracy. The principal safeguards against such biases are care 
and skill in the construction and handling of measuring devices, plus the 
intelligent use of randomization. 

It is worth while to consider from time to time whether simplifications 
can be brought into the technique without undue loss of accuracy, 
Many experiments involve chemical determinations which vary little 
from unit to unit within the same treatment. For instance, in sugar- 
beet experiments, the experimental error of the amount of sugar per 
acre is due mainly to the error in the weight of roots per plot. The 
amount contributed by the sugar content percent is nearly negligible. 

With compound measurements of this type, the labor devoted to the 
less variable component can be reduced with little loss of accuracy. If 
the chemical measurement is expensive, a considerable saving in cost 
may be made. Tests of significance of the chemical composition need 
not be sacrificed. For example, if the experiment has six replicates, а 
chemical analysis might be made for each treatment on bulked material 
from the first three replicates and another on bulked material from the 
last three replicates. These data provide a valid test of significance based 
on two "replicates." 


2.34 Selection of Experimental Material. The choice of the experi- 
mental unit may be of importance. In the planning of field experiments. 
numerous studies have been made of the variability among crop yields 
on plots of various sizes and Shapes under uniform treatment. From. 
these data the best size and shape are selected. The criterion should 
bato obtain the maximum accuracy for a given expenditure of time and 
abor. 

Frequently, uniform material is prepared specially for experimental 
purposes, as in the development of inbred lines by laboratories engaged 
in animal experimentation. Alternatively, the experiment may be con- | 
fined to a sample chosen for its homogeneity from a large batch of ex- 
perimental material. If the results of the experiment are to be applied 
to unselected material, these types of specialization have potential dis- 
advantages. Responses obtained to the treatments in highly selected 
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material may not be the same as the responses in unselected material. 

Selectivity in the material is difficult to avoid where experiments are 
conducted in the field of economies or sociology. In testing some method 
of farm management, for instance, the experiment may require the active 
participation of a group of farmers. The success of such experiments 
depends greatly on the tact and resourcefulness of the investigator in 
persuading farmers to cooperate so that the participants are a represent- 
ative sample of the population about which generalizations are to be 
drawn. 


2.35 Additional Measurements. In the course of an experiment it 
may be possible to take supplementary measurements which predict at 
least to some extent the performances of the experimental units. In an 
experiment to measure the effects of different diets on the weights of 
children, their weights at the start of the experiment would be a supple- 
mentary measurement of this kind, since the increase in weight of a 
child during the experiment is probably correlated with his initial 
weight. 

By a technique known as the analysis of covariance, we may estimate 
from the data the extent to which the observations were influenced by 
the variations in these supplementary measurements. The average 
response to each treatment can then be adjusted so as to remove the 
experimental error arising from this source. In the school experiment, 
the adjusted responses to treatments represent approximately the re- 
sponses that would be obtained if all the children had the same initial 
weight. Thus the effects of variations in the initial weights are largely 
eliminated from the experimental error, without the necessity of equal- 
izing the initial weights in the planning of the experiment. 

Since the analysis is a statistical technique of comparatively recent 
origin (it was first presented by Fisher about 1932), it may be unfamiliar 
to many investigators who could use it to obtain a substantial increase 
in precision with little effort. Its purpose is to remove experimental 
errors arising from extraneous sources of variation which it is impractical 
or too costly to control by a more refined technique. Cases are fairly 
common where the use of covariance has more than doubled the effi- 
ciency of the experiment. The extra work involved consists in taking 
the subsidiary measurements and in applying the adjustments to the 
treatment means. An example of the computations is given in section 
3.8. 


2.36 Planned Grouping: Complete Blocks. Finally, we may attempt 
to minimize the experimental errors through the choice of the experi- 
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mental plan. The prospects of increased accuracy by this means have 
been widely explored during the past thirty years. This book is in- 
tended as a source of reference to the numerous procedures that have 
been developed. 

The basic idea is simple. Consider an experiment of which a number 
of separate replications have been conducted. The experimental errors 


of the results from any replicate can arise only from sources of variation 
that affect the units within that replicate. Consequently, the error of 
the difference between two treatment means taken over a number of 
replicates must also arise solely from variations within the individual 
replications. Variations from one replicate to another do not contribute 
to the errors. In carrying out an experiment we utilize this simple fact 


by trying to control sources of variation that affect different units in the 
same replicate. We need not attempt to reduce differences among the 
replicates. For example, if the experimental units form a very heter- 
ogeneous batch, we attempt to group them so that units in the same 
replicate are similar; we do not worry if the units in one replicate are 
not at all similar to those in another. By this device, precise experi- 
ments can often be made from what at first sight appears an unpromising 
batch of material. Similarly, if a uniform experimental technique cannot 
be maintained throughout the experiment, the important point is to 
keep the technique uniform within a replication; changes should be 
made when moving from one replicate to another. This type of design, 
known in agriculture as randomized blocks, is discussed in section 4.2. 

The idea is carried a stage further in the latin square (section 4.3). 
The treatments are arranged diagrammatically so that each appears once 
in every row and once in every column of a square array. Variations 
among the groups of experimental units which correspond to the rows 
and also among those which correspond to the columns are eliminated 
from the experimental errors. For example, in a comparison of 5 dif- 
ferent ways of performing some manual operation, the rows may repre- 
sent 5 operators, each of whom carries out all the methods in turn. The 
columns, numbered from left to right, may prescribe the order in which 
each operator performs the 5 tasks, The plan of a latin square design 
for this experiment might read as follows. 


Order of procedure 
Operator (1) (2) (3) (4) (5) 
1 D A B E с 
П А В С р Е 
ш В Е А с р 
iv с р Е В А 
ү Е е р А в 
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The letters A, B, С, D, E represent the 5 methods. Thus operator I 
carries out method D first, method A next, and so on. With this ar- 
rangement, the experimental errors are unaffected by differences among 
the abilities of the men and also by systematic variations introduced by 
the order in which the methods are performed. If the opportunity 
presents itself, this type of control may include a third factor by the use 
of a graeco-latin square (section 4.5). 


2.37 Planned Grouping: Incomplete Blocks. In the randomized block 
design, the precision depends to some extent on the number of treat- 
ments. As this number increases, it becomes more difficult to keep the 
variations between experimental units within each replicate small, and 
the error variance per unit tends to increase. In experiments with a 
large number of treatments, numerous attempts have been made to 
avoid this loss of precision by the formation of groups of experimental 
units which do not contain all the treatments and therefore can remain 
small, These groups are called incomplete blocks. The groups are so 
constructed that we can still remove the effect of the differences among 
groups from the experimental errors, just as such differences are auto- 
matically removed in the randomized blocks design. With incomplete 
blocks, however, this removal usually requires the application of adjust- 
ments to the treatment means, so that the statistical analysis of the re- 
sults becomes more complex. 

The best method of constructing the incomplete blocks varies with the 
nature of the experiment. Sometimes certain comparisons among treat- 
ments are of greater interest than others. For instance, in a field experi- 
ment containing a number of early-maturing and an equal number of 
late-maturing varieties of a crop, comparisons between pairs of varieties 
which mature at about the same date will presumably be more important 
than comparisons between an early and a late variety. An incomplete 
block might contain either all the early or all the late varieties, a pair of 
blocks forming a complete replication. This arrangement, of which 
several variations are possible, is known as the split-plot design (section 
ТЫН. 

A similar situation occurs in factorial experiments. Certain com- 
parisons, e.g., the average effects produced by varying one factor, are 
nearly always of immediate interest. Others, which represent rather 
complex relationships among the effects of different factors, may be of 
minor importance, often because previous experiments have shown that. 
these interrelationships, called interactions, exert little or no influence. 
It has been found that, by a deliberate sacrifice of precision in the 
estimates of these interactions, the size of the group or block can be re- 
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duced so as to produce an increase іп (һе precision with which the average 
effects of the factors are estimated. This principle, known as con- 
founding, can also be combined with the use of the latin square (chapters 
6 and 8). 

If all comparisons between pairs of treatments are potentially of equal 
importance, a different method is used in forming the blocks. When 
treatments are arranged in incomplete blocks, two treatments which oc- 
cur in the same block are more precisely compared than two which are 
placed in different blocks, at least in so far as the investigator has sue- 
ceeded in decreasing the variation within blocks. Therefore the goal is to 
construct a design such that any pair of treatments occurs equally often 
within some block. This type of problem has been investigated pre- 
viously for its 1aathematical interest. A solution can be found for any 
number of treatments and any size of block, but most of the solutions 


7%. require too many replications for the usual conditions of experimentation. 


The solutions available to date (called balanced incomplete block designs 
(chapter 11)) are given for all cases in which the number of replications 
does not exceed ten. Here again the latin square principle can some- 
times be used (chapter 13) to eliminate variation amongst two different 
types of grouping of the experimental units. 

For a given number of treatments and a given size of incomplete 
block, balanced designs allow little choice in the number of replications. 
Thus, with 64 treatments and blocks of 8 units each, 9 replications are 
required to balance the design. In order to provide designs for small 
numbers of replications, a number of additional types have been de- 
veloped; these are similar to balanced designs except that they lack 
complete symmetry. 

, One group, called lattice designs (chapter 10), has been used widely 
in agricultural experiments during the past 15 years. For these designs, 
the number of treatments must be an exact square, e.g., 25 or 49, while 
the number of units in the block is the corresponding square root, 5 or 
7, respectively. A further group, known as cubic lattices (section 10.4), 
is useful when the number of treatments exceeds 100 yet it is desirable 
to have a small block. In this case the number of treatments is the cube 


of the number of units per block, and the number of replications may be 
any multiple of 3. 


2.38 Summary. There are numerous methods available for increasing 
the accuracy of an experiment. Frequently the same end may be 
reached in several different ways. Thus, in the experiment which tests 
the effects of a number of diets on the weights of children, the disturbing 
effects of variations in their initial weights can be reduced by selecting 
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for the experiment a number of children of approximately the same initial 
weight, or by the use of the analysis of covariance, or by a planned 
grouping in which children within the same replicate are of approxi- 
mately equal initial weight, or simply by having a large number of 
replications. The method adopted should be that for which the desired 
standard of accuracy can be attained with the smallest expenditure of 
time and effort. There is no special merit in either a complicated ex- 
perimental plan or a highly refined technique if equally accurate results 
can be secured with less effort in some other way, A good working rule 
is to use the simplest experimental design that meets the needs of the 
occasion. This is not to say that the more complex designs will be used 
only rarely; in fact, they have already demonstrated their utility in 
fields of research where no other equally practicable method appears to 
exist for reaching the same results. 


2.4 The Grouping of Experimental Units 


2.41 Investigation of Methods for Grouping. In order to take full 
advantage of the opportunities for increased precision by suitable group- 
ing of the units, the investigator must know the best criteria for group- 
ing. Frequently these are suggested from previous observations on the 
experimental material or on disturbing factors that become evident in 
the course of an experiment. Where knowledge is less definite, as in a 
new line of research, there are two ways in which information may be 
accumulated. 

One is to devote an experiment specifically to this problem. In this 
case it is usually advisable to subject all units to the same treatment, 
carrying out a so-called uniformity trial. The word uniformity refers to 
uniformity of treatment; the experimental material and technique should 
be representative of those which are used in actual experiments. The 
amount of material should be sufficient to enable the experimenter to 
superimpose on the results a hypothetical trial of the size which he cus- 
tomarily performs. It is worth while to record any auxiliary measure- 
ments which might predict the performance of the experimental units. 
From the results of these trials any proposed grouping of the units can 
be formed and the amount of variation within and among the groups 
can be calculated. If it is desired to compare different experimental 
plans, each may be superimposed on the results of the uniformity trial. 
By the method indicated in section 2,31, the investigator may estimate 
the relative numbers of replications that are needed to reach the same 
degree of precision with the various plans. 

Secondly, useful information can sometimes be secured without any 
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special trial or technique by examination of the results of actual етрегі- 
menis. With an experiment which is suitably designed, it is possible to 
estimate from an analysis of variance of the results what the error vari- 
ance would have been if any particular grouping had not been used. This 
estimate, being derived from the same data, is directly comparable with 
the error variance obtained by the use of the grouping and enables the 
investigator to evaluate the success of the grouping. Consequently, for 
experiments in a new line of research, the experimenter may try any 
method of grouping that might be effective, in the knowledge that its 
appropriateness will be revealed by the experiments themselves. In- 
structions for performing such comparisons, which are easily made when 
the analysis of variance has been completed, are given in the notes which 
accompany each type of design. 


2.42 Criteria for Grouping. Many factors have been used as the basis 
for grouping. In agricultural experimentation it has been repeatedly | 
demonstrated that plots close together tend to be more similar in their | 
yields than plots farther apart. The almost universal practice is to put 
neighboring plots in the same group, with the groups approximately 
square in shape whenever practicable. Contiguity frequently forms a 
good basis for grouping in greenhouse experiments also, where differences 
may exist along the bench in temperature, sunlight, air currents, or ac- 
cessibility for watering. Alternatively, in many plant experiments, 
and more particularly in animal experiments, some characteristic of the | 
plant or animal is much more useful, the physical location of the ex- 
perimental units throughout the course of the trial having a relatively 
minor influence on the results. Age, weight, vigor, sex, and genetic con- 
stitution are some of the factors most commonly used. The animal it- | | 
self may constitute the block in cases where the treatments can be applied | 
in succession to the same animal without producing residual effects which | 
obscure the results, 

Grouping is helpful in coping with the effects of time trends in experi- 
ments that extend over a number of work periods. It is often found, in 
both the biological and the industrial laboratory, that the level of yield 
shifts slightly from day to day and, to a greater extent, from week to 
week and from month to month. If, for example, five days are required 
to complete the work of applying the treatments in an experiment that 
has five replicates, the wise procedure is to complete a single replicate 
on each day. In this way any changes in the level of yield that oceur 
from day to day will not influence comparisons made among the treat- 
ments. The alternative method of applying all replicates of treatment 1 
on the first day, treatment 2 on the second day, and so on, is an open 
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invitation to bias, since any day-to-day shifts in level will show up as 
treatment effects. 

A single grouping, as in the randomized block design, can be used to 
eliminate simultaneously variation from a number of different sources. 
For example, suppose that an experiment were planned to investigate 
whether some treatment applied to rats enabled the animals to with- 
stand subjection to a dose of a poisonous gas. The replication would 
contain two animals, a treated rat and an untreated rat to be included 
as a control Given a sufficient stock of experimental animals, the pairs 
in any replication might be animals of the same litter (and consequently 
of the same age), of the same sex, and of approximately the same weight 
and vigor. If it were convenient to test two animals at the same time, 
the pair could be put into the gas chamber together, thus eliminating 
also the effects of variations in the strength of the dose or in the time for 
which it was supplied. 

The freedom with which the size of block can be varied depends on 
the type of research. In field experiments the block can be laid down so 
as to accommodate any number of treatments, though, as previously 
stated, the standard error per plot may be expected to increase slowly as 
the size of block increases. When the use of designs with incomplete 
blocks is contemplated for field experiments, the principal question is to 
decide for what number of treatments the reduction in block size be- 
comes worth while. On the other hand, the number of leaves оп a plant, 
or of animals of the same litter, usually varies within rather narrow 
limits. In such cases, where the most homogeneous grouping fixes the 
size of the block, incomplete block designs may be of special interest. 

The gains in efficiency obtainable by skillful construction of the 
blocks, when measured in terms of increased replication, are often large; 
examples can be cited in which the effective replication has been in- 
creased from two- to tenfold. Frequently, apart from the preliminary 
care and thought involved, an accurate design uses the same material 
and involves the same physical operations as the design which it replaced. 
In some of the more complex designs the computations required to es- 
timate the treatment averages and to perform tests of significance be- 
come rather laborious and involved. The experimenter's reaction to 
these difficulties will depend on his aptitude for numerical computations. 
At a first trial considerable time may be spent in mastering some of the 
methods, but the processes become easier with familiarity. 
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СОН АРТ ЕК, 3 


Notes оп the Statistical 
Analysis of the Results 


3.1 Introduction 


With regard to the statistical analysis for a design, the procedure 
adopted in this book is to present the computations, with worked nu- 
merical examples for the designs likely to be most frequently used. To 
supply a complete mathematical justification of the analysis for each 
type of experiment, although in many respects desirable, would have 
unduly increased the amount of material. The same general technique 
governs the analysis of all designs, differences arising only іп the adapta- 
tion of the technique to the particular structure of a design. In the 
present chapter this technique—the method of least squares—is de- 
scribed and applied to one of the simplest designs. With perseverance 
the method may be used to verify the computing instructions for other 
designs in this book. It should be pointed out that the computing meth- 
ods that are easiest in practice are not necessarily those that flow from a 
straightforward application of the theory, so that at first sight some of 
the instructions may appear different from those given by theory. 

Some account will also be given of certain statistical techniques, such 
as the technique for missing values and the analysis of covariance, that 
are particularly useful in handling the results of experiments, though 
they are perhaps too specialized to be included as part of a general in- 
troductory course in statistics. 


3.2 The General Method of Analysis 


3.21 The Mathematical Model. The illustrative data come from an 
experiment conducted in 1935 by the Rothamsted Experimental Station 
(3.1). The object of the experiment was to measure the effectiveness of 
4 soil fumigants in keeping down the numbers of eelworms in the soil. 
The fumigants were chlorodinitrobenzene (CN), carbon disulphide jelly 
(CS), and two proprietary preparations, *Cymag" (CM) and “Seekay” 
(CK). Each fumigant was tested both in a single and a double dose. 
These comprize 8 distinct treatments. The control (no fumigant) was 
45 
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the ninth treatment. Normally the replication would have consisted 
of 9 plots, but in this case 4 control plots were placed in each “replicate,” 
which actually contained 12 plots. The purpose was to supply a fairly 
accurate standard against which to measure the performance of the 
fumigants. 

The experiment was laid out in 4 separate blocks of 12 plots each, so 
that there were 4 replications for each dose of each fumigant and 16 
replications of the control. Within a block the 12 “treatments” were 
allotted to individual plots at random. The experiment is of the 
familiar type known in agriculture as randomized blocks (see chapter 4. 
Тһе fumigants were ploughed in during spring, after which a crop of oats 
was sown. After harvest, a sample of 400 grams of soil was taken from 
each plot and the number of eelworm cysts counted for each sample. 
The data are those labelled “second count” in table 3.1. 


TABLE 3.1 PLAN AND NUMBERS OF EELWORM CYSTS PER 400 GRAMS OF 
SOIL, FIRST COUNT ABOVE, SECOND BELOW 


Block 1 Block 2 | 
Totals Totals 
2587 964 
4383 3075 
Block 3 Block 4 
Totals Totals 
1743 872 
4752 2470 


In a formal analysis of the results the first step is to set up ап eq 
tion for every observation. This equation expresses the observation | 
the sum of four components: (i) a general average about which 
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observations are presumed to be fluctuating; (ii) a component repre- 
senting the effect of the treatment applied; (iii) a component represent- 
ing certain environmental effects which the design of the experiment 
enables us to isolate; and (iv) a residual component, representing all 
other sources that influence the observation, and generally referred to as 
the "experimental error." 

Component (iii) cannot be fully appreciated without some knowledge 
of the technique employed in the construction of designs. This technique 
consists in arranging the conduct of the experiment so that, in the sub- 
sequent analysis of the results, differences among certain groups of ob- 
servations can be measured. What is more important, the effects of such 
differences can be eliminated from the estimates of the treatment effects, 
If this elimination were not feasible, these differences would contribute 
to the “experimental error,” component (iv), with the consequence that 
less accurate estimates of the treatment effects would be obtained. In 
the present example, as will be seen, these differences are differences 
among the groups of 12 observations that constitute a block. With 
each design that we describe, the environmental components that are 
eliminated in this way are pointed out. 

We proceed to write down the mathematical model as applied to the 
example. Instead of numbering the observations from 1 to 48, it is con- 
venient to use a subscript % to denote the treatment applied, j to denote 
the block in which the observation lies, and k to denote the order within 
the block. For the plots that receive fumigants, the subscripts $ and 7 
are sufficient to define each plot uniquely. The subscript Ё is needed to 
distinguish between different control plots within the same block. In 
this notation the equation for any observation may be written 


уйк = и + Ti + Bj + ei (3.1) 
where и represents the general mean, т; the effect of the treatment, 6; 
that of the block, and еу. the experimental error. 


3.22 Assumptions Made in the Model. There is already implicit іп 
the model one assumption that may not be true in the data obtained 
from experiments, This assumption is that the treatment effect and the 
block effect are additive. If we take the difference between two observa- 
tions in the same block, say уд, and у», we have, аз a deduction from 
the model, 

Vi — Ул, = 71 — T2 + еры — Фа (3.2) 
The block effects have disappeared from the right-hand side; that is, 
the model implies that the difference between the true effects of two 
treatments is the same in all blocks. On a block with a high eelworm 
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infestation a successful fumigant is presumed to reduce the eelworm 
numbers by the same amount as on a block with a low eelworm infesta- 
tion. This is unlikely to be so, except as a first approximation. Little 
systematic study of the accuracy of this approximation in practice has 
been made. From general experience it appears that the approximation 
works well in a large number of experiments; nevertheless, cases are not 
infrequent where the approximation is poor. 

In addition, several assumptions are made about the residual effects 
ejr. These are taken to be independent from observation to observation, 
and to be distributed with zero mean and the same variance c^. Further, 
for tests of significance and the estimation of confidence limits, the 6% 
are assumed to follow the normal or Gaussian frequency distribution. 
In practice, these assumptions are only approximately fulfilled. The 
consequences of errors in the assumptions will be indicated later (section 
3.9). 


Apart from changes in detail, the type of model and assumptions | 


presented for this example apply to all experimental arrangements in- 
eluded in this book, except for two types of design for which a more 
complex model is required. These are the split-plot designs (chapter 7), 
where two or more different error variances are postulated, and in- 
complete block designs (chapter 9), where additional assumptions are made 
about the nature of the environmental effects 8j. 


3.23 Estimation of the Treatment Effects. When the assumptions in 
the previous section are valid, there is а well-known result in statistical 
theory that the best estimates of the unknowns р, 7; and В; aro obtained 
by the method of least squares. This method chooses estimates m, li, and 
bj, respectively, which minimize the sum of squares of the residuals 

У (ise m — t — bj? (3.3) 
taken over all observations. 

In order to find these minimizing values we differentiate the sum of 
squares with respect to each unknown in turn, and set the derivative 
equal to zero. Consider a specific unknown, e.g., t 1f any square con- 
tains t; its derivative is 


—2(угь— m — t; — 5j) 
If the square does not contain t; the derivative is zero. Hence, if the 


derivative of the sum of squares in (3.3) is set equal to zero, the resulting | 
equation may be written 


DX Wie -m ~t; — bj) =0 


E (m+ t +L) =Dd vie 6A 


or 
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where the sum is over all observations whose equations contain ¢;, or 
more generally over all observations whose equations contain the param- 
eter to be estimated. 

Equations (3.4) are called the equations of estimation, or the normal 
equations. As we have seen, the equation of estimation for any param- 
eter is constructed by equating the total of the observed values у; to 
the total of the expected values (m + t; + bj) over all observations whose 
equation contains the parameter. 

We now write down some normal equations for the example. Since 
m occurs in all 48 observations, its normal equation is 


48m + 4(l + te +--- №) + 16% + 12 + +... 44) =G (8.5) 


where tg denotes the control treatment, and G the grand total of all 48 
observations. At this point a simplification may be introduced. In 
the model the quantities t; are needed only to indicate by how much an 
individual treatment differs from the average of all treatments in the 
experiment. Accordingly the model is not changed in any essential 
feature if we assume that the mean of all the ¢; is zero. It may also be 
shown that any convenient linear function of the & may be assumed to 
be zero. The same remarks apply to the block values b;. From the form 
of equation (3.5) it appears that convenient assumptions of this kind are 


ti +t tet tet Als =0 (3.6) 
b; + be + by + by = 0 (3.7) 


With these assumptions equation (3.5) reduces to 48m = G, so that 
m becomes the mean value of all observations in the experiment. 

By summing over the observations that receive a particular treat- 
ment t; we find that its normal equation is . 


4m + 4t; + by + bz Fbs +, = T; (i= 1,2, +++, 8) 
and for the control, 
16m + 16t + 40, + ba + ba + b4) = Ts 


where the 7’s denote observed treatment totals. From relation (3.7), 
these equations reduce to 
T; Тә 
homm — i= 1,2, ---, 8); =——т 3.8 
bry ШИЙ @ = 1,2, Ji Шлі Tg (3.8) 
As might be expected, the effect of any treatment is estimated by 
taking the difference between the mean of the observations which re- 
ceive that treatment and the general mean. 
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Let us express these estimates in terms of the original model (3.1). 
By adding the relevant equations we find 


T; = Ap + 4r; + (в: + Bo + Bs + Bs) + Хей» 
То = 1би + 16% + 4(8: + Ba + Bs + Ba) + Хе 


Hence, using (3.8), 
m +t; =u + т; + $8 + 82 + Ba + Bs) + ё; 


where ё; is the average of the residual errors on plots that receive treat- 
ment i. For the difference between a pair of treatments we deduce 


i; — ty = (Ti — Tu) + (& — ё) (3.9) 


This shows that the estimated difference equals the true difference plus 
an error of estimate (6; — ёи). Note that the error of estimate does not 
contain any of the environmental (block) effects arising from compo- 
nent (ii). This result exemplifies a valuable property of least squares 
solutions, namely, that the estimate of the difference between the ef- 
fects of any two treatments is not influenced by the environmental ef- 
fects, but only by the residual effects еу. It is in this sense that we say 
that environmental effects are eliminated from the estimates of the treat- 
ment effects. 

Estimates of the block or environmental effects are usually of less 
interest. It may be verified that the least squares estimate of 8; is 


bj = 3 -m 
where B; is the observed block total. 


3.24 Tests of Significance and Confidence Limits. When the estimates 
of the treatment effects have been obtained, the details of the subsequent 
study of them vary with the type of experiment. Usually, however, 
some of the following procedures form the basis for the final conclusions: 
(i) a test of significance of the null hypothesis that the effects т; and 
Tu of two treatments are identical, or, more generally, of the hypothesis 
that some linear combination of the 7; is zero; (ii) the construction of 
confidence limits for the difference (т; — т„) between the effects of two 
treatments, or for a linear function of the treatment effects; (iii) a test 
of significance of the null hypothesis that a group of treatments Ті» 
то, ** 5, тр all have identical effects. Іп this section the theory govern- 
ing these procedures is presented briefly, without proof. The arith- 
metical methods are described in the next section. 
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2. Test of significance of the difference between two treatment effects. 
From equation (3.9) we have 


ti — ty = (ri — Tu) + G ё) (3.9) 


Since the e’s are normally distributed with zero means, it follows by 
theory that (t; — 1) is normally distributed about the true difference 
(т; — ти). Further, the variance of (t; — tu) may be shown to be 


1 1 
E 
Ti Tu 


where r; and r, are the numbers of replications of t; and tu, respectively. 
Since it is useful to be able to calculate the variance of any comparison 
in which we are interested, the rules for doing this are given in section 
3.5. 

In order to proceed we need an estimate of the experimental error 
variance с?. From theory, the best estimate is known to be 


ac уба т 0—63) 
Ne 
where the residual sum of squares is taken over all observations. The 
divisor ne, called the number of degrees of freedom (d.f.) in the estimated 
error, is given by the rule 
пе = (Total number of observations) — (number of independent 

parameters that were estimated) 
The word independent is introduced because we nearly always include 
in the original model more parameters than are strictly necessary. In 
the example we found that we could assume one linear relation (3.6) 
among the 28 and one (3.7) among the b’s. Consequently the number of 
independent parameters is 

1 (for т) + 8 (for the 28) + 3 (for the b’s) = 12 
Since there are 48 observations, n, is 36. A 

Hence, if the null hypothesis is true, i.e., 7; = Tu, then (t; — ty) is 

normally distributed with mean zero and estimated variance 


The ratio 
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is known to follow Student’s t-distribution with ne, or 36 d.f. This is the 
quantity used for a test of significance of the null hypothesis. 

With a more complicated function of the estimated treatment effects, 
say L = (wit; + wate +- - -+ wil), where the w's are any set of num- 
bers, the procedure is essentially the same. The estimated variance of 
L is ӘУ, where f is a numerical factor found by the rules given in section 
3.5. Fora test of the null hypothesis that the true value of L is zero, we 
use the ratio L/sV/f, which is distributed as Student's ¢ with n, degrees 
of freedom. 

di. The construction of confidence limits. The theory for a test of sig- 
nificance also leads to confidence limits for the unknown true difference 
(т; — ти). For, when т; and т, are not necessarily equal, the quantity 


(ti — tu) — (ri — ти) 


n 1 
NM 
т Tu 


may be shown to follow Student's t-distribution with n, degrees of free- 
dom. Hence the confidence limits are given by 


(т; — ти) = (в — tu) + sd(n,, а) 5. 


where d(n,, а) is the value of Student's t corresponding to n, degrees of 
freedom and the chosen confidence probability a. A similar method is 
used for the more general function L. 

iit. A test of the identity of a group of treatment effects. Quite frequently 
the first test to be made is that of the hypothesis that ull Ё treatments 
have produced identical effects, 1.е., that all т; are equal. In other cases 
we are more interested in the hypothesis that some subgroup of the treat- 
ments, say treatments 1 to p, has produced the same effects. We con- 
sider the latter test, since it reduces to the first test if p is put equal to К. 

This test is provided by a general theorem that has many applications. 
First, we find the least squares estimates of all treatment and environ- 
mental effects in the usual way, and compute the residual sum of squares 


S? = У (Wir — m — t; — bj)? 


Next we start again and rewrite the mathematical model, inserting the 
restriction that тү = ту =... = тр. That is, wherever any of these 
т; appears in the equation for an observation, we replace the т; by a 
common symbol, say 7'. The least squares estimates are computed for 
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this new set of equations, using the same observations. Again we com- 
pute the residual sum of squares which may be denoted by 


S? = D vise т —t' — fy 
This will always be found to be at least as large as 817. Finally, the 
theorem states that, if the null hypothesis is true, the quantity 


S?-S? S? 
2:1 Ne 


follows Snedecor’s F-distribution with (р — 1) and п. degrees of freedom. 

Although the procedure may appear rather complex, the test criterion 
used is a reasonable опе. 517 is the sum of squares of deviations of the 
observations from the values predicted for them by the original model, 
so that it measures how closely the original model agrees with the data. 
52° plays the same role for the restricted model. Consequently, if 92 is 
much larger than 512, we are inclined to think that the restricted model 
does not fit the data nearly so well as the original model, and therefore 
to reject the null hypothesis that the restricted model is the correct one. 
However, the value of (922 — 512) alone does not provide a measure of the 
improvement in the fit with the original model. If, for instance, this dif- 
ference is 10, then, other things being equal, we should regard the im- 
provement in fit as greater when S;? is 1 than when S;? is 1000. It is this 
type of consideration that leads to the use of the ratio (S2? — 812) /512 
as the essential part of the test criterion. 

Although the theorem postulates two separate sets of least squares 
solutions, nearly all designs are constructed so that only one веб of solu- 
tions must be found in practice. A number of designs which could have 
been included in this book were omitted because the least squares 
analysis seems too cumbersome for frequent use. 


3.25 Тһе Analysis of Variance. All the procedures (t-tests, F-tests, 
and construction of confidence limits) use the residual sum of squares, 
which will often be called the error sum of squares. This quantity could 
be found by calculating for each observation угз the value (m + t; + bj) 
predicted by the least squares solution. The sum of the squares of the 
differences between observed and predicted values could then be ob- 
tained. This method is slow, and the error sum of squares is much more 
quickly computed by a technique known as the analysis of variance. 

In the original model, each observation is represented as the sum of 
four components due respectively to the general mean, the effect of the 
treatment, the environmental effect, and the residual effect. In the same 
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way the analysis of variance partitions the sum of squares of the ob- 
servations into four sums of squares, one attributable to the general 
mean, one to differences between the estimated effects of the treatments, 
one to the environmental effects which the experiment is capable of 
measuring, and lastly one which is the residual or error sum of squares. 
In most cases we compute the original sum of squares and the first three 
components, obtaining the error sum of squares by subtraction. 

The analysis of variance provides much more than a short-cut method 
of securing the error sum of squares. The sum of squares due to treat- 
ments is the quantity (S2? — 512) needed for the F-test of the hypothesis 
that no differences exist between the effects of the treatments. By a 
slight extension the analysis also supplies the sum of squares required 
for testing the equality of the effects of a subgroup of the treatments. 
The component due to environmental effects enables us to estimate by 
how much the accuracy of the experiment has been increased by eliminat- 
ing these effects from the estimates of the treatment means. 

The analysis of variance depends on a number of algebraic relations 
which will be illustrated for the eelworm experiment. From section 3.23 
it will be recalled that the least squares normal equation for any un- 
known, say &, was 


È (Vir — m — t; — bj) =0 
summed over all observations which received the ith treatment. If we 
multiply this equation by t; and add the equations for different treat- 
ments together, we obtain the equation 

Ук — m — ti — b) = 0 (3.10) 
where the sum is now over all observations, since every observation is 


associated with one and only one &. Similarly we may establish the re- 
lations 


È min — m — t; — bj) =0 (3.11) 
Ab — m — ti — bj) =0 (3.12) 
where both sums extend over all observations. 


A few more relations of this type are required. If we add the estimates 
t; over all plots in the jth block, the sum is 


(i Ft tte ++ + 4% 


But this is zero because of equation (3.6) in which a linear relation among 
the ts was introduced. Hence, if we multiply by 5; and add over all 
blocks, we establish the relation 


Хы; = 0 (3.13) 
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where the sum is again over all observations. By similar arguments we 
prove the additional relations 


Хты-0; >) mt; =0 (3.14) 


These relations lead to the partition of the sum of squares of the ob- 
servations. Write 


Yije = m + ti +b; + (и — m — t — bj) 
Square both sides, and add over all observations. The six relations 
(3.10) to (3.14) show that all six sums arising from cross-product terms 


on the right-hand side add to zero. Consequently, we have the following 
analysis of sums of squares. 


Dyin? = т + Vt? Mb .MGa-m-tu-b) (3.15) 


This equation is the basis of the analysis of variance of the results. 


3.26 Application to the Example. To obtain the left-hand side of 
(3.15) in practice, we simply compute the sum of the squares of all ob- 
servations. For the eelworm data this will be found to be 5,481,198. 
The components due to the mean, to treatments, and to blocks are not 
usually calculated from (3.15) as it stands, because it is quicker and more 
accurate to obtain them from ѓоѓаїз rather than from the estimated ef- 
fects. 

Thus, for the mean, the contribution is 48m?, since there are 48 ob- 
servations. If G is the grand total of all observations, 14,680, this соп- 
tribution may be written G?/48, or (14,680)?/48, which amounts to 
4,489,633. This term is sometimes called the correction for the mean. 

Written in full, the sum of squares for treatments is 

4012 + +... ts?) + 164 (3.16) 
But from (3.8), 
T; Т» 


je (i = 1,2, +++, 8); == 


where Т; is the observed treatment total. By substitution, (3.16) be- 
comes 


A — 4m)? + (Ta — Am)? +---+ (Ts — 4т)?] + 3g (То — 16m)? 
When each parenthesis is expanded, we have 
T? +T? + +--+ Т? 


Ta? 
: tus 2m(T, + Ta +-+ Te T + 48m? 
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Now the sum of the observed treatment totals Т; is the grand total б, 
or 48m. Consequently the last two terms above may be amalgamated to 
give ý 
T? +T +-+ T Т, 
Та Ее $7 9 ^ dám (3.17) 
4 16 
Note that the last term is the correction for the mean, already found to 
be 4,489,633. Expression (3.17) is an example of the more general ex- 
pression 


which gives the treatments sum of squares when the ith treatment is 
replicated r; times, C being the correction for the mean. 

То obtain the numerical value we first caleulate the treatment totals 
ав shown in table 3.2. 


TABLE 3.2 TREATMENT TOTALS FROM TABLE 3.1 


Level of 
application CN cs CM CK Totals 
0 5858 5858 
1 1066 928 1431 892 4317 
2 1265 877 . 1241 1122 4505 


The treatments sum of squares is given by 
(1066)? + (1265)? +- - -+ (1122)? к (5858)? 
4 16 


— 4,489,633 = 157,448 


The blocks sum of squares, У) 0,2, can likewise be expressed in terms 
of block totals as follows. 


By? + By? + B? + Be 


=¢ 
12 
(4383)? + (3075)? + (4752)? + (2470)? 
= 5 — 4,489,033 — 289,427 


where the observed block totals have been inserted from table 3.1. The 
divisor 12 is the number of observations per block. 

The complete analysis of variance is shown in table 3.3. Tt is not 
customary to display the correction term for the mean, since this is of no 
particular interest. The item labelled “total” at the foot of the table is 
the original sum of squares, minus the correction for the mean (5,481,198 
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— 4,489,633 = 991,565). This may be shown to be equal to the sum of 
squares of deviations of the observations from their mean; i.e., 


Уи — 48m? = У) (Yije — т)? 


TABLE 3.3 ANALYSIS OF VARIANCE OF THE DATA IN TABLE 3.1 


Degrees of Вит of Mean 

Г freedom squares square 

Source of variation (d.f.) (s.s.) (m.s.) 

Treatments 8 157,448 19,681 

Blocks 3 289,427 96,476 

Error (by subtraction) 36 544,690 15,130 
Total 47 991,565 


The column which has been computed above is, of course, the column 
labelled “sum of squares." The other two columns remain to be ех- 
plained. The "degrees of freedom" associated with any component are 
the number of independent parameters required to describe that com- 
ponent in the model. In the case of treatments, this always equals one 
less than the number of treatments, and similarly for blocks. For the 
total s.s., the number is the number of observations (48) less one repre- 
senting the contribution of the mean. 

The degrees of freedom have two principal uses. First, by subtraction, 
they give the degrees of freedom for error (36). As mentioned in section 
3.24, this is the divisor needed for the error s.s. in order to estimate the 
error variance с?. "Тһе estimate is 544,690/36, or 15,130, which is called 
the error mean square and is shown in the right-hand column of table 3.3. 
Second, the degrees of freedom for treatments are used in an F-test of 
the hypothesis that all К treatments produced the same effects. Іп 
section 3.24 the value of F for testing this hypothesis was given as 


82 3 КАШ З SZ 


Ё = i 
k—1 Пе 


The denominator S;?/n, is the error m.s. In the numerator the quan- 
tity (Sg? — 812) may be shown to be the treatments s.s., while (k — 1) 
is the number of degrees of freedom associated with treatments. Thus 
the numerator is 157,448/8, or 19,681, called the treatments mean square. 
The F-ratio for this experiment is 19,681/15,130, or 1.30, with 8 and 36 
df. 

In the same way the mean square for blocks forms the numerator for 
the F-test of the hypothesis that all block effects are identical. Although 
this test is rather seldom of interest, the mean square for blocks has other 
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uses. By formulae which will be given with the individual designs, this 
mean square leads to an estimate of the error variance that would have 
been obtained if the experiment had not been grouped into blocks. 

With some designs the structure of the analysis of variance is more 
complicated than in the example above. What happens is that the 
treatment and environmental contributions to the sum of squares are 
entangled. If we calculate the treatments s.s., a different result is ob- 
tained according to whether we assume that block effects are present or 
absent, and vice versa for blocks. In such cases the usual procedure is 
to present the blocks s.s. as calculated when treatment effects are ig- 
nored, and the treatments s.s. as calculated when block effects are taken 
into account. These items are sometimes called “blocks, ignoring treat- 
ments” and “‘treatments, eliminating blocks,” respectively. If calcula- 
tions are made in this way, the two terms may still be added so as to 
obtain the error s.s. by subtraction; further, the treatments s.s. is still 
the appropriate one for an F-test of the treatment effects. 

This presentation of the analysis of variance is necessarily very in- 
adequate. For further reading, reference should be made to Fisher 
(3.2, 3.3) and Snedecor (3.4), whose discussions involve little math- 
ematics. For presentation of points of theory, see Kendall’s book (3.5) 
and papers by Fisher (3.6, 3.7) and Yates (3.8). Unfortunately, no single 
reference contains a really comprehensive account of the subject. 


3.3 Accuracy in Computations 


3.31 Original Records. If there exists a series of rules which will 
guarantee that all computations are accurately done, we do not know 
them. The following notes may help in maintaining a high standard of 
accuracy. 

The first place where errors may occur is in the original records, These 
should be made in a clearly legible and permanent form. Where scale 
readings are being taken, the habit of checking each reading immediately 
after it has been written down is a useful one. As the observations are 
being recorded, any that appear anomalous should be examined and the 
reason sought. Gross errors are often eliminated in this way. If on 
checking such an observation no error or explanation is found, a note to 
this effect should be made. Such notes are helpful in situations where 
the later analysis is done by a different person. Most statisticians who 
analyze other people's experiments have enountered the perplexing 
problem where an observation is so completely out of line that a gross 
error is suspected, yet the original records contain no comment about 
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the observation, and the recorder’s memory of the circumstances in 
which it was taken has faded. 

Copying from the original records to a form suitable for analysis may 
also introduce errors. Since cross-checking is rather a soporific task, mis- 
takes tend to be overlooked even when two persons are used for the 
check. Sometimes foresight can eliminate the need for copying. 


3.32 Checks in the Analysis of Variance. The analysis of variance it- 
self consists of two sorts of operations. First, a number of totals are 
formed. These are nearly all self-checking, for the fact that the treat- 
ment totals and the block totals both add to the grand total may be ac- 
cepted as proof of their correctness. This is not a watertight check, since 
compensating errors in two treatment totals pass undetected. Also it 
does not ensure that treatments and blocks are correctly identified, and 
cases have occurred where treatment totals were mistakenly labelled as 
block totals, and vice versa. But apart from such rare events the sum 
check is as satisfactory as any. 

The second stage, the computation of the various sums of squares, is 
not self-checking and must be checked by recomputation. The most ac- 
curate method is to have two computers carry out the analysis inde- 
pendently, comparing results only when both have finished the task. 
The practice of having one computer “check” the results of the other, 
though more convenient, always seems to produce a certain number of 
errors. This is to be expected, because, when the first calculations are 
nearly always correct, as they should be, it is difficult for the checker to 
avoid the presupposition that they are correct, with resulting mental 
laziness. Recomputation by the same person, though often unavoidable, 
is probably less accurate than the two preceding methods. 

An expert who wishes to do so can make the analysis self-checking. 
For instance, the error s.s. can be computed independently of the other 
sums of squares, sometimes quite easily and sometimes with more diffi- 
culty. If this sum of squares agrees with that found by subtraction, all 
the items in the "sums of squares" column may be regarded as correct, 
apart possibly from errors in labelling. However, with the computa- 
tional methods that we present, checking is required except in a few cases 
specifically noted. 

The steps in the preparation of summary tables (e.g., calculation of 
treatment means from totals) are usually partly but not entirely self- 
checking. For example, the fact that treatment means average to the 
general mean forms a check. Care should be taken to recompute those 
steps that are not self-checked. 
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With the more complex designs, where treatment means have to be 
adjusted in order to climinate the environmental effects, errors have 
arisen because the computing instructions were misunderstood. For 
instance, the adjustments made were ten times as large as they should 
have been. A person thoroughly familiar with the design would have 
known that the adjustments were ridiculously large, but someone new 
to the design may have no idea what size of adjustment it is reasonable 
to expect. This danger with complex designs should not be ignored. It 
can be minimized by careful study of the method of analysis for any 
design that it is proposed to use. 


3.33 Number of Figures to be Retained. Since the primary purpose 
of the analysis of variance is to obtain an accurate value for the error 
m.s., the number of figures which it is worth retaining during the cal- 
culations depends on the size of this figure. This is not known in ad- 
vance, but often a rough idea of the coefficient of variation (ratio of the 
standard error to the mean of the experiment) is available from previous 
experience. For the original data from which the analysis of variance is 
computed, a crude rule which errs on the safe side is as follows. Record 
the original data to 4 significant figures if the coefficient of variation is 
between 0.4 and 4%, to 3 if it is between 4 and 40%, and to 2 if it exceeds 
40%. Let us apply this rule to the eelworm data. Since experiments on 
eelworms are not very common, there is not much basis on which to 
predict the coefficient of variation. Suppose that we guess that it is un- 
likely to be below 20% (it actually turned out to be 40%). Then, for 
safety, data should be recorded to 3 significant figures, Since the average 
eelworm number per plot is around 300, this means that data are re- 
corded to the nearest eelworm, which is as accurately as they could be 
recorded. 

A more precise rule, requiring a little calculation, is that the rounding 
interval should not exceed one-quarter of the standard error per observa- 
tion. To apply this rule to the eelworm data, starting with the same 
premise that the coefficient of variation is 20%, we note that the mean 
per plot in the experiment is about 300. Thus the predicted value of the 
standard error per observation is about 60, so that the rounding interval 
should not exceed 15. It will be quite satisfactory to round each ob- 
servation to the nearest 10 eelworms. The discrepancy from the result 
given by the previous rule illustrates the fact that the first rule tends to 
be too conservative. 

The advisability of deliberately rounding records already taken should 
be left to the judgment of the experimenter. With modern calculating 
machines it may be more expeditious to carry out the computations with 
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a few unnecessary figures than to take time to perform and check the 
rounding. The rules are most likely to be useful where computing de- 
vices are poor or unavailable. 

Іп the analysis of variance itself, it is generally as well to carry the full 
number of figures obtained from the uncorrected sum of squares; e.g., if 
the original data contain one decimal place, the sum of squares and hence 
the analysis of variance will contain two decimal places. This number 
will usually be excessive, but no more time is likely to be wasted in 
writing down the unnecessary digits than in estimating how many digits 
should be retained. 

In the final presentation of results, on the other hand, superfluous 
digits should be strictly avoided. They make the conclusions more 
difficult to grasp, impede rapid mental comparisons, and give some 
readers an erroneous impression of the accuracy of the results. A treat- 
ment mean should be rounded to one-tenth of its estimated standard 
error; that is, if the estimated standard error of the treatment means is 
2.56, the means could be rounded to 0.256 and should be rounded to 0.1, 
1.е., to one decimal place, since this is the nearest rounding interval that 
is convenient. In some experiments (e.g., those of the split-plot type) 
different treatment comparisons have different standard errors. Here 
the rounding interval should be decided from those comparisons that 
have the smallest error. 


3.34 Identification of Data. The practice should be followed of label- 
ling all data, from original records to summary tables, so that another 
person at a later time can tell what the data mean. Much data, ex- 
pensive to collect and potentially valuable for some research, have had 
to be discarded or destroyed because the original collector cannot be 
reached, and new investigators are unable to discover what measure- 
ments were taken and under what circumstances. Sometimes the units 
in which the observations cr summary tables are presented cannot be 
found, or the crop on which a field experiment was carried out, or the 
nature of the experimental treatments. Frequently, data serve their 
most fruitful purpose when some later investigator gathers from many 
places all the material bearing on some question, and it should be a habit 
to facilitate rather than hinder such research. 


3.4 Subdivision of the Sum of Squares for Treatments 


3.41 Reasons for Subdivision. Often an experiment is planned to 
provide the answers to a number of different questions, not necessarily 
connected with one another. The eelworm experiment is a partial 
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though not an ideal example. One way in which the questions asked 
might be framed is as follows. (i) Is soil fumigation effective? Here we 
are interested in the average reduction in eelworm numbers taken over 
all fumigants. (ii) Are there differences in the effectiveness of different, 
fumigants? For this question, comparisons of the reductions due to dif- 
ferent fumigants are relevant. (iii) What is the relative effectiveness of 
single and double doses? Since the simplest reasonable hypothesis is 
that the reduction in eelworm numbers to the double dose is twice that 
to the single dose, we might narrow this question by asking: Is the aver- 
age response to fumigation proportional to the amount of dressing, or in 
other words is the average response curve linear? (iv) Finally, if there is 
some indication of curvature in the responses, we might ask whether the 
amount of curvature is the same for all fumigants. 

With experiments of this type the F-test of the complete treatments 
m.s. is not particularly helpful because it is directed, as it were, at a 
mixture of several diverse questions. By an extension of the analysis of 
variance, we can subdivide the treatments s.s. into s number of com- 
ponents that are more relevant to the individual questions. Moreover, 
an F-test сап be made on the mean square for each component. The 
rules for subdivision are given in succeeding sections. Effective use of 
the device requires long practice applied to a considerable number of ex- 
periments, 


3.42 Subdivision into Single Components. The simplest case, where 
all treatments have the same number r of replications, is considered first. 
The calculations are best made from the treatment totals 7), Examples 
of the sort of quantity that we are interested in studying are 


Түсү шк сын T EE 


Quantities of this type are called linear functions of the T’s. Note that 
the sum of the coefficients of the 7’s is always zero, as it must be if the 
quantity is to represent а comparison among the 775. These ideas may 
be formalized as follows. 

Definition. Any linear function 


Zu = Т + То +--+ БъТЬ 
ін called a comparison among the 7”в if 


lor + hog ++ ++ lon = 0 
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Rule 1. И z, is any comparison among the 775, the quantity 


ci 2 2 2 
pi? where D, = r(La? + lus? 4-4 lok 


isa component of the sum of squares for treatments and represents 1 d.f. 
This rule gives the divisor D, to be used so that the comparison 2, 
may be included as a part of the treatments s.s. in the analysis of vari- 
ance. Incidentally, the rule implies that z,?/D, cannot exceed the treat- 
ments s.s. If it is found to do so, this is a sure sign of a mistake in com- 
putation. 
Definition. Two comparisons 21 and 22 are said to be orthogonal if 


lily іі +- +++ р = 0 


This “sum of products" relation is very important. By theory, it 
ensures that 21 and z are distributed independently of each other. Its 
relevance to the partition of the treatments s.s. appears in rule 2. 

Rule 2. If гү and zg are orthogonal, then 2;?/D» is a component of 

212 
Treatments s.s. — — 
Di 
This means that, if we divide the treatments s.s. into the contribution 
from a comparison z; and the remainder, and now wish to subdivide the 
remainder, we must choose comparisons that are orthogonal to 21. Sim- 
ilarly, after removing the contribution of 22, the next comparison #3 must 
be orthogonal to both 21 and 22, and so on. This leads to the next rule. 

Rule 3. И the comparisons zi, 22, +++, zy .1 are mutually orthogonal, 
1.е., every pair is orthogonal, then 

Treatments S A a 
n а Big. же — dE er 
eatments s.s Di’ Ds Dian 

This algebraic identity partitions the treatmentss.s. [which has (k — 1) 
degrees of freedom] into (k — 1) components, each representing a single 
degree of freedom. 

Starting with a specified 21, we can always find 22,23, * **, zx .1 80 as to 
construct a complete orthogonal set. In fact, it may be proved that there 
is considerable freedom of choice in this process. When all but the final 
2—1 have been selected, there is only one possible choice for 2,. 1, but at 
any previous stage many 28 can be found that are orthogonal to all pre- 
ceding z’s. The experimenter must choose the z’s that are most relevant 
for purposes of interpretation. 
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Table 3.4 shows the single components most frequently used for inter- 
pretative purposes when there are three or four treatments. It is as- 


TABLE 3.4 SAMPLE SETS OF SINGLE COMPONENTS 


Three treatments 
i, Equally spaced increments of one ii. Two qualities of an ingredient and 
ingredient a control 
д z т ET 

T: -1 1 T, (0) -2 0 

Т» 0 -2 T. (а) 1 -1 

Тз 1 1 Тз (ш) 1 1 

Component Lin. Quad. Effect Quality 

divisor D — 2r* 6r Component of a diff. 
divisor D 6r 2r 


Four treatments 


i, Equally spaced increments of one йі. Two comparable types of ingredient, 


ingredient а and b, and two qualities of each type 

an E? 23 21 E 23 

T; -3 + —1 Ті (а) +1 +1 0 

Т» -1 -1 +3 Та (а) +1 —1 0 

Тз +1 -і —3 Тз (b) -1 0 T1 

Т, +3 +1 +1 Ті (ы) -1 0 -1 
Component Lin. Quad. Cubic Quality Quality 

D 20r 4r 20r diff. diff. 
Componenta vs.b withina within b 

D 4r 2r 2r 


lii. Different levels of each of two different ingredients a and b 


21 E 23 
Ті (аһ) = -1 +1 
Та (аЬ) +1 —1 -i 
Та (aiba) —1 +1 _ї 
Ті (azb) +1 +1 FT 


Average Average 
response response 
Component toa tob Interaction 
D 4r 4r 4r 


н r* = number of replicates, assumed the same for all treatments. The coefficients 
given above are not valid if r varies from treatment to treatment. 


sumed that all treatments have the same number of replicates. The 
divisors D are those required for inserting the square of z in the analysis 
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ot variance. The reader should verify that these divisors satisfy rule 1, 
and that each веб of 278 forms a complete orthogonal set. 

With three treatments the two sets presented are essentially the same, 
though the meaning attached to each z is different. Case i applies in an 
experiment where we һауе, say, zero, single, and double dressings of a 
fumigant, and wish to isolate the linear and quadratic components of the 
response curve. The response found by fitting a straight line to the three 
levels is represented by 21, while 22 measures the deviation from a linear 
response. Case ii arises when we have comparable dressings of two dif- 
ferent fumigants, and wish to examine the average effect of fumigation 
(21) and the difference between the fumigants (22). 

With four treatments, case i shows zero, single, double, and triple ap- 
plications of the same ingredient; here the response curve can be divided 
into its linear, quadratie, and cubie components, Alternatively, in case 
ii, we might be testing the wearing qualities of two types of “100% wool" 
suits and two types of suits made of wool-rayon mixtures. Case iii is 
used, for instance, when we examine the effects of two different amounts 
of sugar and two different amounts of vanilla in the preparation of a cake. 
Experiments of this type are discussed more fully in chapter 5. 

When different treatments have differing numbers of replicates, the 
rules are changed slightly. If the ith treatment has r; replicates, we have 
shown that the treatments s.s. is 

T, Te T? (Ti+ Ti 4T 
4 + ... + = 
т T2 Tk Or) 


The changes to be noted are: 
i. A linear function 


Zw = Ш Та + Т +--+ + Т 
is a comparison among the treatment totals 7; if 
т + тәгә +++ тык = 0 
ii. The divisor required for 2,7 is 
D, = ты? + rolo dd ndo 
iii. Two comparisons 21 and 22 are orthogonal if 


т\л + 7222 +--+ тїз = 0 


3.43 Incomplete Subdivisions. The subdivision need not be com- 
plete in the sense that it is composed entirely of single components. In 
the eelworm experiment we suggested that the treatments s.s., with 8 
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components, might be divided into the following parts: (i) a single com- 
ponent representing the average response to fumigation; (ii) a single 
component representing the deviation of the average response from lin- 
earity; (iii) a part representing differences in the responses to the dif- 
ferent fumigants. Since there are 4 fumigants, this part comprizes 3 
components; (iv) a part representing differences in the curvature of the 
response curves for different fumigants: this also has 3 components. 
Some rules for partial subdivision will be given and applied to this ex- 
ample. The most general rule is as follows. 

Rule 4. А set of g quantities Q; are independent components of the 
treatments s.s., with n; degrees of freedom respectively, if 


(012-9) 


where all 2;; are mutually orthogonal comparisons, and the D;; are the 
appropriate divisors. Further, the remainder 


Treatments s.s. — pil 0; 
is an independent component of the treatments 8.8. with 
[(& — 1) — У nj] degrees of freedom 


This rule, which is a deduction from previous rules, is not very helpful 
to the beginner. With experience it often becomes easy, without carry- 
ing out the details, to see whether a set of Q; can be expressed in this 
way. Some more specialized rules follow. 

ae 4a. If Ty, Тз, +++, Тр are the totals for any set of treatments, 
then 
Tuas DE NCC туу 
n T2 Tp > "i 
isa component of the treatments s.s., with (p — 1) degrees of freedom. 

This rule states that the sum of squares of deviations among any sub- 
group of the treatments, with proper divisors, is part of the treatments 
8.8. 
| Rule 4b. Sometimes the treatments (or part of them) can be divided 
into a number of groups. The number of treatments in a group need not 
be constant, and different treatments may have different amounts of 


Bur ae Let Ту; be the total for the jth treatment in the ith group, 
and let 


Si = Ta + Ti 4 Ты; 
Ri = ra + ri ddr @=1,2,...,0) 
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Then let 


Та? + Та? к= та? _ 8 


DN -1,2,. 

£ та Tiz Ты Ri ы n 
512 S? (Sı ++ Se) 

беа е ЯЗ лын d ( 1 Э) 
Ry Re Ritet Е 


In this case Q; represents the sum of squares of deviations among the 
treatments in the ¿th group, with (p; — 1) degrees of freedom, while 
Qe41 is the sum of squares of deviations among the group totals with 
(0 — 1) degrees of freedom. These Q’s are independent components of 
the treatments s.s. 

Rule 4c. This applies when all treatments have the same number r of 
replicates, and every group contains p treatments. Let 


Zi = Та + Тв +++ Tip 


where the set of Рз remains the same in all groups. Then 


is a component of the treatments s.s., with (g — 1) degrees of freedom. 
This rule is useful when we wish to compare linear functions of the treat- 
ments within each group. Note that the divisor in Q is the same as the 
divisor that would be used for a single 2;. 

Applications. In the eelworm experiment, the average effects of fumi- 
gation are obtained from a comparison of the total for the control with 
the totals for the single and double levels of fumigation. Consequently, 
we might regard the 9 treatments as divided into 3 groups: the control, 
the single levels, and the double levels. Ву rule 45, this gives the follow- 
ing partition of the treatments s.s. (the treatment totals are found in 
table 3.2, p. 56). 


Between fumigants (single level): 
(3 4.4.) 


(1066)? + (928)? i (1431)? + (892)? — SA = 45,461 


Between fumigants (double level): 
(3 d.f.) 
2 2 2 2 
(1265)? + (877)? + (1241)? + (1122)* (4505) _ o yy 


4 16 
5858)? 2 2 2 
Between levels: (585895: (4817): (Бору 080) а: 88,347 
(2 4.4.) 16 48 


The three sums of squares add to 157,449, in agreement with the treat- 
ments s.s., 157,448, as given in table 3.3, р. 57. 
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The next step is to divide the sum of squares between levels into a 
component representing the average response to fumigation and one 
representing the deviation from a linear response. Ву least squares 
theory, the former is given by a comparison of the double level with the 
zero level. That is, we may take 

zı = Sp — So = 4505 — 5858 = — 1353 
where the subscript refers to the level. The deviation from linearity is 
measured by the orthogonal comparison 
22 = (S2 - Si) — (Si — So) = б + So — 28) 
= 4505 + 5858 — 2(4317) = 1729 

Since each S is a total over 16 plots, the divisors are 32 and 96 re- 

spectively. Hence the contributions to the sum of squares are: 


(1353)? 


Average linear response: = 57,207 


(1729)? 


Average curvature: = 31,140 


The subdivision found thus far might be presented as shown below. 


d.f. 8.8. m.s. 
Between treatments 8 157,449 19,681 
Average linear response 1 57,207 57,207 
Average curvature 1 31,140 31,140 
Between single levels of fumigants 3 45,461 15,154 
Between double levels of fumigants 3 23,641 7,880 


This is not the subdivision envisaged at the beginning of this section. 
Instead, it was proposed to divide the final 6 d.f. into 3 representing dif- 
ferences in the responses to individual fumigants and 3 representing dif- 
ferences in the curvatures of the individual response curves. This 
separation is more difficult. 

Consider a comparison of (CN) and (CS). Estimates of the difference 
between these fumigants are available both at the single (ICN) and the 
double (2CN) level. If the effects of both fumigants are proportional 
to the amounts of dressing, the true difference at the double level will 
be twice that at the single level. On this assumption, two independent 
estimates of the difference between (CN) and (CS), for a single dressing, 
are 


KICN) — (1С8)] and [Sm 2908) 
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The second estimate is considerably more accurate than the first, since 
its variance is only № as large. Statistical theory shows that such 
estimates are combined by weighting each inversely as its variance. 

Hence, on the assumption of lincarity, the most accurate estimate of 
the difference per unit dressing is 
(1СМ) — (1C8)] + (HCN) — (2С5)] 

1+4 
(1СХ) - (1CS)] + 2[(2CN) — (2CS)] 


= : (3.18) 


This shows that differences in the linear responses to the 4 fumigants 


are measured by comparisons of the quantities (ICN) + 2(2CN), ete. 
"These quantities are given below. 


(Single level) + 2(double level) 


(CN) (CS) (CM) (CK) "Total 
3596 2682 3913 3136 13,327 


It is for quantities of this kind that rule 4c is useful. This rule shows that 
the sum of squares of deviations of these quantities, when divided by 
72 l2, or (4) [(1)? + (2)%], i.e., 20, is а component of the treatments s.s. 
This gives 43,408, representing differences in linear responses to the 
fumigants. 

We now consider the final three components. The curvature of an 
individual response curve is measured by a comparison of the type 


(2СХ) — (LCN)] — [(1CN) - (0)] = (9СХ) + (0) - 2(1€N)] 


where (0) represents the total number of eelworms over 4 plots for the 
control treatment. When the difference between two fumigants is taken, 
the (0) term disappears and we obtain the comparison 


[(2CN) — 2(1CN)] — [(2CS) — 2(1С5)] (3.19) 


It is easy to verify that this is orthogonal with comparisons like (3.18). 
Consequently differences in curvature are compared by means of the 
quantities shown below. 


2(Single level) — (double level) 
(CN) (CS) (CM) (CK) Total 
867 979 1621 662 4129 


Rule 4c again applies. The divisor is 20, and the sum of Squares con- 
tributes 25,693. The final separation is given in table 3.5. It is evident 
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from F-tests that none of the mean squares is significant at the 5% level, 
and only that for the average linear response approaches near to the 
5% level. These results must not be taken as final conclusions for this 
experiment. As will be seen in section 3.8, additional data were recorded 
in this experiment which permit a more accurate analysis. 


TABLE 3.5 SUBDIVISION OF THE TREATMENTS 8.8. FOR THE EELWORM DATA 


Source of variation d.f. 8.3. m.s. 
Average linear response 1 57,207 57,207 
Average curvature 1 31,140 31,140 
Differences in linear response 3 43,408 14,469 
Differences in curvature 3 25,693 8,564 
Error 36 544,690 15,130 


A more extensive discussion of this general topic, with a number of 
examples, is given in Snedecor’s book (3.4), chapter 15. It is worth re- 
peating that the amount and type of subdivision that should be done de- 
pend on the experiment, and to some extent on individual taste. As will 
be seen in the next section, any single component can be tested by means 
of a t-test derived from the treatment means, and some workers prefer to 
make the test in this way. 


3.5 Calculation of Standard Errors for Comparisons among 
Treatment Means 


3.51 Rules. We first give a rule (rule 5) that is more general than 
we need, because it is sometimes useful for other purposes. 
Rule б. The standard error of any linear function 


2 = hy + lye +-+- + Бух 
of the individual observations is 


0: = с Vl? +12 +... Ly? 


Тһе estimated standard error is obtained by substituting s for о, where s 
is the square root of the error m.s. in the analysis of variance. This rule 
supplies the standard error of any kind of linear function, provided that 
it has been expressed in terms of the individual observations. For linear 
e uc of the treatment means, one of two special cases of the rule is 
used. 


Rule ба. И all treatments have the same number r of replications, the 
standard error of a linear function 


z = hi + 9 +---+ hig 
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of the treatment means 7; is 


а 
“Жл VL? +? eoe 12 


Rule 5b. И the ith treatment has r; replications. the standard error of 
zis 


As in the general case, we substitute s for о in order to obtain the esti- 
mated standard error. Finally, a rule that may save labor in complex 
cases is as follows. 

Rule 6. И the linear functions 21, 22, * ++, 2р are mutually orthogonal, 
the standard error of 


2 = Ца + 1525 +-+ 12 


is 


д: = М Цор? + lo? +--+ + 1,70,” 


where c;? is the variance of z;. 

This rule enables the work to be done in two stages. Sometimes, in a 
large experiment, it is relatively easy to express the function z in terms 
of a number of familiar orthogonal functions, but rather tedious to write 
2 in terms of original observations. The variances of the 2; may be found 
by rule 5 and substituted in rule 6. Note that different z; may involve 
the same set of observations, provided that the z: are orthogonal. 


3.52 Examples. These rules will now be illustrated by application to 
the eelworm experiment. The treatment means are shown in table 3.6, 
the figures in parentheses denoting the order in which the treatments are 
numbered. 


TABLE 3.6 MEAN NUMBERS OF EELWORMS PER PLOT 


Fumigant 
Level (CN) (CS) (CM) (CK) Means 
0 366(9) 306 
1 266(5) 232(6) 358(7) 223(8) 270 
2 316(1) 219(2) 310(3) 280(4) 281 


From table 3.5 the standard error per plot is У (15,130), or s = 123 
with 36 d.f. 
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Example 1. Standard error of the average linear response. This is 
measured by the average response to the double dressing, 1.е., 


z = li + дг + ӱз + 9) — Jo = —85 


Since the first 4 means are based on 4 replicates, while yp has 16 repli- 
cates, we apply rule 5b, which gives 


wn) GS) GG) Ga) Goa) Go) 


=— = 43.5 
8 


The value of Student’s ¢ is 85/43.5, or 1.95. 

As mentioned previously, the t-test of any comparison, as made above, 
is identical with the F-test of the corresponding component in the 
analysis of variance. The basic relationship is F = £^; that is, if the 5% 
values of ¢ for n degrees of freedom are read from the table and squared, 
their squares are the 595 F values for 1 and п degrees of freedom, and 
similarly for any other significance level. This result may be verified in 
the present instance. The value of ? is 3.80, with 36 d.f. From the 
analysis of variance in table 3.5, the F value for the average linear re- 
sponse is 57,207/15,130 ог 3.78, with 1 and 36 d.f., the two values 
agreeing apart from rounding errors. The reader may check that the 
same agreement holds for the average curvature. 

Example 2. Response to the single dressing of (CN). The response is 
measured by 


z= ys — yo 
Once again rule 56 is appropriate. 
Т 1 ізУ5 
= 123 „|-+— = : - 
4 16 4 


Example 8. Difference between the.linear responses to (CN) and (CS). 
In section 3.43 it was shown that this difference is 


- [95 — Je + 291 — 292] 
5 


Since all means have 4 replicates, rule 5a may be used. 


а= 


1+1+4+4 = (123)(V10) = 389 


$, = 


(2)(5) 
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Example 4. Although the experiment is not complex enough to ex- 
hibit a profitable application of rule 6, this example shows that the rule 
agrees with the other rules. Suppose that at the double level we had 
compared (CN) with (CS), and also the two chemicals with the two 
proprietary mixtures. The comparisons are 


ИР» ӱз + 


£90]! zw 2 2 
By rule 5a their standard errors are 
ГД т 
asi = 52; в = 5 


and it will be noted that the two functions аге orthogonal. 
If now we wished to compare the double dressing of (CN) with the 
mean of the 3 other double dressings, the comparison would be 


pag BEBE 
3 
By a direct use of rule 5a, we have 
с 4): 5nd c 
ШАҒА ltgtgtg^ và 
But alternatively we may write 
2a 2% 
T қақ % 


and apply rule 6 to give 


- este -- 99400 -- 


in agreement with the more direct method. 


3.53 Testing Effects Suggested by the Data. In order that F- and t- 
tests be valid, the tests to be made in an experiment should be chosen 
before the results have been inspected. The reason for this is not hard 
{о вее. If tests are selected after inspection of the data, there is a natural 
tendency to select comparisons that appear to give large differences. 
Now large apparent differences may arise either because there are large 
real effects, or because of a fortuitous combination of the experimental 
errors. Consequently, in so far as differences are selected just because 
they seem to be large, it is likely that an undue proportion of the cases 


74 NOTES ON THE STATISTICAL ANALYSIS ОЕ THE RESULTS 3.53 


selected will be those where the errors have combined to make the dif- 
ferences large. The extreme case most commonly cited is that of the ex- 
perimenter who always tests, by an ordinary t-test, the difference be- 
tween the highest and the lowest treatment means. If the number of 
treatments is large, this difference will be substantial even when the 
treatments produce no real differences in effect. It may be shown that 
with 3 treatments the observed value of ¢ will exceed the 5% level in the 
table about 13% of the time. With 6 treatments the figure is 40%, with 
10 treatments 60%, and with 20 treatments 90%. When the experi- 
menter thinks that he is making a t-test at the 5% level, he is actually 
testing at the 13% level, or the 40% level, and so on. To summarize, 
the selection of those differences that look large and therefore “inter- 
esting” invalidates the ordinary tests of significance. The effect is to 
obtain too many significant results, or to raise the significance level of 
the test from the presumed 5% to some higher level, usually unknown. 

Sometimes the difficulty can be resolved by using for the test of sig- 
nificance a special table which takes account of the selection that has 
been exercised. For the problem discussed above, there is a table of the 
significance levels of the ratio of the range of a group of treatment means 
to an independent estimate of the standard error of a treatment mean. 
This ratio is called the studentized range (3.1а). The table furnishes a 
fully valid test of significance of the difference between the highest and 
the lowest treatment means. More generally, the table gives a con- 
servative test of the difference between any pair of treatment means 
that are picked out because their difference looks interesting or large. 

Occasionally, one treatment mean is found to differ by an unexpectedly 
large amount from all the other treatment means, In this event we may 
want to test the difference between the aberrant mean and the average 
of all the other treatment means, with due allowance for the fact that 
the largest difference of this type has been selected. For this, the table 
of the extreme studentized deviate from the sample mean can be used 
(3.22). In our notation this is a table of the significance levels of the 
absolute value of 

(и — 9)/s 


where ў, is the outlying mean, 7 is the mean of all the treatments, and s' 
is the estimated standard error of a treatment mean. Another useful 
table (3.32) applies when a group of treatment means are all being com- 
pared with a standard treatment, 

The most general test of this type (3.3, 3.42) applies to any linear 
Comparison among the treatment, totals (or means) that has been picked 
out after inspection of the results, Tf г is the comparison, in the notation 
of section 3.42, the standard error of г is of the form sVD, where s is 
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the estimated standard deviation рег unit, based on n d.f. The sig- 
nificance level of the ratio z/s4/D сап be taken as V(t — ПР, where t 
is the number of treatments and F has (4- 1) and n d.f. 

To illustrate, suppose that the treatment totals over 4 replicates are as 


follows: 
T, T» Т; Т, Т; 
16.2 17.1 18.4 24.9 25.8 


The experimenter is interested in the unexpected disparity between 
the pair T4, T; and the rest of the treatments. He takes 


2 = 3(T4 + Т5) — 2(Т, + Т» + Т) = 47.2 
By the rule in section 3.42, 
Ds 4(3? + 3? + 2? 4 22 + 22) = 120 
If s = 1.23, with 12 d.f., 
2 47.2 


sV/D (1297190 — 


For 4 and 12 d.f., the 5% level of F is 3.26. The 5% significance level 
of this ratio is therefore \/ (4)(8.26) = 3.61. Тһе comparison іп ques- 
tion falls just short of significance. 

This test requires high ratios in order to attain significance, because 
it allows for the possibility that we may have picked out the comparison 
z for which z/s\/D has the highest attainable value. If the comparison 
is the difference between а, pair of treatments, or between one treatment, 
and the remainder, the tests described earlier are more sensitive, 


3.50 


3.54a Multiple Comparisons. In an experiment with more than two 
treatments, the investigator frequently makes several 5% t-tests in the 
process of summarizing the results. If there are no differences between 
the true treatment means, the probability of obtaining an apparently 
significant result in one of these tests is 0.05. Tt has long been гесор- 
nized, however, that if several t-tests have been performed, the prob- 
ability that at least one of these is apparently significant is greater than 
0.05. If the t-tests are independent, this probability is 0.23 for 5 tests, 
0.40 for 10 tests, and 0.64 for 20 tests. These results raise the question 
whether we ought to try to control the frequency with which any wrong 
statement is made in the summary of an experiment, rather than the 
frequency with which a wrong statement is made in an individual t-test. 
The same issue appears when several confidence intervals are com- 
puted. The probability that at least one of the intervals does not en- 
close the true treatment difference is greater than the stipulated 0.05. 
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Recent research (3.52, 3.6a, and 3.7a) has clarified this and related 
questions. One problem to which the research has contributed is that 
of arranging a set of treatment means in order of performance. Since 
significant differences rarely exist between every pair of treatments, the 
most that can usually be done is to create a partial separation, such as, 
with four treatments, 


A с В р 


The line joining A and С denotes that these two treatments have not 
been shown to be significantly different; A is, however, significantly 
different from В and D. Similarly, С differs significantly from D but 
not from B, and B differs significantly from D. 

Іп the past, this separation has often been made by first performing 
an F-test on all the treatment means. If F is not significant, no t-tests 
are made, the four means being regarded as indistinguishable. If F is 
significant, the ordinary t-test for the difference between two means is 
applied to every pair of means. It saves time to compute the least 


significant difference, V 2st os/ Vr. Any two means whose difference ex- 
ceeds this value are declared significantly different. 

Duncan (3.5a) has examined the type of protection which this method 
gives the experimenter against erroneously finding significant results. 
If all four true means are in fact identical, the F-test assures 95 chances 
in 100 that the means will be declared indistinguishable. If a specified 
pair of true means are identical, the combined F- and t-tests give a 
protection level of at least 95 chances in 100 that the means will be 
declared indistinguishable. If, however, three of the true means are 
identical, the protection level for the trio depends on the position of the 
fourth mean. When the fourth mean is far removed from the three 
identical means, the level is about 87%. As the fourth mean moves 
nearer to the other three, the level lies between 87% and 95%. 

In the newer methods, the protection levels for sets of two, three, 
four, etc., means are kept at or above preselected values. The Newman- 
Keuls method (3.5a) keeps all levels at 95%. Duncan’s multiple range 
test (3.5a) makes the level 95% for sets of two means. For sets of 
three means, the level is 90.25%, i.e. (0.95)?; for sets of four means it 
is (0.95)', or 85.7%, and so оп. This decrease in the level of protection 
makes the multiple range test more sensitive than the Newman-Keuls 
test in detecting real differences, at the expense of lowered protection. 
As Duncan suggests, his test appears to be a reasonable working com- 
promise for the experimenter. 
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8.6 Subdivision of the Sum of Squares for Error 


3.61 Reasons for Subdivision. The sum of squares for error can be 
partitioned into components in the Same way as that for treatments, 
Although such subdivisions are not so frequently required as with treat- 
ments, they have a number of uses. Sometimes there is reason to believe 
that the error s.s. is not homogeneous; that is, the residual errors е do 
not all have the same variance о? as postulated in the mathematical 
model. In this case, as will be shown in section 3.63, subdivisions of the 
error may be necessary in order to obtain valid t-tests, Occasionally a 
gross error in some observation may be suspected, and it is helpful to 
calculate the contribution of this observation to the error s.s. In ad- 
dition, a subdivision may help in understanding the nature of the error 
m.s. 


3.62 Rules for Subdivision. These follow the same general pattern 
as with treatments, though they are a trifle more complicated. Tt is 
necessary to go back to the normal equations (3.4) from which the treat- 
ment and block effects were estimated. ‘These equations took the form 
DX (vin —m — t; — b) =0 (3.4) 
In other words, the residuals that provide the estimates of error must 
add to zero over any treatment or over any block. Consequently, if the 
linear function 
2 = Y gig 
is to be a component of the error, the coefficients J; must sum to zero 
over all observations that receive any specified treatment and also over 
all that аге in any specified block. These are the tests by which we tell 
whether any proposed z is part of the error. 
For any z which satisfies these tests, the contribution to the error s.s, 
is z?/D, where by rule 5 
D= Die? 


We now examine some components of the error s.s. in the eelworm ex- 
periment. Obviously, the difference z between the eelworm numbers on 
any two control plots in the same block satisfies the conditions. Sim- 
ilarly any comparison among the 4 control observations in a block is a 
component of the error. It follows that the sum of squares of deviations 
of these 4 observations from their mean contributes 3 d.f. to the error 
8.8. The 4 blocks together contribute 12 d.f. of this type of error. The 
Bum of squares will be found to be 307,312. This component is a meas- 
ure of the amount of variation among observations that receive the same 
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treatment and lie in the same block, and is the type of component that 
we should expect to constitute the error. However, this type of com- 
ponent is encountered only when some treatments have been replicated 
within the block. 
Consider now the difference between two other treatments, say (ICN) 
and (105). Let 
zı = (1CN)ı — (1CS)1 


where the subscript 1 denotes that the two observations come from 
block 1. This function is not a component of the error, for, while the 
coefficients add to zero over every block, they do not do so over treat- 
ments (1CN) and (108). But if we take 


zı — 22 = (10№), — (108) — (LCN) 2 + (1С5)» 


the conditions are satisfied. This expression shows how the difference 
between two treatments changes from one block to another. It is a 
measure of the effect of the blocks on the treatment difference, and is 
usually called an interaction of treatments with blocks. Such inter- 
actions are the typical components of error. 

Tf z; is the difference between (1CN) and (1CS) in the ith block, any 
comparison among the 2; satisfies the rules required for a component of 
the error. Hence the sum of squares of deviations of the z;, when divided 
by 2, contributes 3 d.f. to the error 8.3. А similar result holds if we take 
any comparison among the treatments and calculate it separately for 
each block. Since the 9 treatments provide 8 independent comparisons, 
we see that the 24 d.f. for the interactions of treatments with blocks may 
be divided into 8 sets of 3 d.f. The sum of squares for these 24 d.f. can 
of course be found by subtraction as shown in table 3.7. 


TABLE 3.7 SUBDIVISION OF THE SUM OF SQUARES FOR ERROR 


d.f. 8.8. m.s. 
Error 36 544,690 15,130 
Among controls 12 307,312 25,609 
Treatments X blocks 24 237,378 9,891 


The F-ratio of the two components of error is 2.59, with 12 and 24 d.f. In 
the ordinary table this corresponds to a significance level of about 214%. 
It is probably more correct to make a two-tailed test by doubling this 
probability. Either test indicates that the controls are more variable 
than the other treatments. This is not very surprising, since in data of 
this type the variance may tend to increase as the mean increases. In 
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any event the result throws doubt on the suitability of the original 
mathematical model; it would seem better to ascribe different error 
variances to the control and the fumigants. 

The control also contributes to the treatments X blocks component 
of the error. The 8 comparisons among the 9 treatments can be divided 
into 1 between the control and all fumigants, and 7 which are com- 
parisons among the fumigants themselves. The former will contribute 
З df. to the treatments X blocks interaction; the latter, 21 d.f. If the 
control is more variable than the fumigants, we would expect the mean 
square for the 3 d.f. to be larger than that for the 21 d.f., though it should 
be remembered that a mean square with only 3 4.4. is poorly determined. 
In the next section these two components will be computed separately. 


3.63 Calculation of a Separate Error for а Treatment Comparison. Ав 
the eelworm example illustrates, certain treatments may be erratic in 
their effects, while others in the same experiment show more stable re- 
sults. When this occurs, comparisons involving the erratic treatments 
have a higher experimental error variance than those among the more 
stable treatments. Since the error m.s. in the analysis of variance is some 
weighted average of these different variances, its use for individual t- 
tests may not be valid. In such cases it is helpful to be able to caleulate 
a separate error for any specific treatment comparison. The procedure 
will be illustrated for the comparison between the control and the 
fumigants. 

The first step is to compute the comparison separately for each block. 


Totals in block 


1 2 3 4 Total 
Control 1814 1295 1769 980 5858 
Fumigants 2569 1780 2983 1490 8822 
2 (Control) —(Fumigants) 1059 810 555 470 2894 


Since there are 4 control plots and 8 fumigated plots in a block, the con- 
trol total is multiplied by 2 in forming the comparison. The contribu- 
tion to the error s.s. is 


(1059)? + (810)? + (555)? + (470)? (2894)? 
24 96 
The divisor 24 is found by the usual rule. The mean square is 2954, 


with 3 d.f. By subtraction, the mean square for the interaction of fu- 
migants with blocks (21 4.1.) is found to be 10,882. Contrary to an- 


= 8862 
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ticipation, the component involving the control has the smaller mean 
square, though the difference does not approach significance. 

As often happens when the simple model is inadequate, the best 
approach for a more accurate analysis is questionable. It is possible 
to compute a separate error, by the method just given, for each t-test 
that is made. This is the procedure least open to criticism, since it 
makes no assumption that errors are constant from one treatment to 
another. Since, however, each t-test would be based on only 3 d.f., 
these tests are insensitive, so that this method should not be used with- 
out good reason. In the present case it is probably justifiable to pos- 
tulate only two error variances. The first, appropriate to the controls, 
is estimated by the mean square 25,609, with 12 d.f.; the second, for the 
fumigants, by the mean square 10,882, with 21 d.f. By this approach 
the variance of the mean response to the double dressing, for example, 
is estimated as 

10,882 + 25,609 a 
16 16 i 
since each mean is taken over 16 plots. 

Separation of the error s.s. into single components is seldom required 
except in special studies or as an exercise. One method is to set up 8 
orthogonal comparisons among the treatments. Calculate each com- 
parison separately in each block, and let z;; be the value obtained for the 
ith comparison ($ = 1, 2, -.-, 8) in the jth block. Then find 


Wit = 20 — Ziz) Wig = м + Ziz - 223; 
Wig = 21 + 22 + 23 — Зги 


Each w is a component of the error, and the whole 24 w’s are mutually 
orthogonal. The squares of the w’s, each with its proper divisor, give a 
separation of the error s.s. into single components. 


3.7 Missing Data 


3.71 Method of Handling Missing Data in the Analysis. From time 
to time certain observations are missing, through failure to record, gross 
errors in recording, or accidents. The omissions naturally affect the 
method of analysis. With each of the common designs we give computa- 
tional instructions for analyzing data that contain gaps. The object of 
this section is to indicate the theoretical basis for these methods. 
When certain observations are absent, the correct procedure is to 
write down a mathematical model for all observations that are present. 
The least squares normal equations are then constructed in the usual 
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way. These take exactly the same general form as when all observa- 
tions are present; i.e., 

X (иљ — m — ti — bj) = 0 (3.20) 
over all observations whose equations contain any specified parameter 
that is to be estimated. Since, however, thé terms іп the equation cor- 
responding to missing observations are absent, the system of equations 
loses some of the symmetry that it possesses when all observations are 
present, and the solutions are more difficult. The same general procedure 
supplies F- and t-tests of hypotheses about the nature of the treatment 
effects as described in section 3.24, though again the details become more 
complicated. 

In the analysis of variance, two changes may be noted. Owing to 
the missing observations, the treatments and blocks s.s. become en- 
tangled, so that the treatments s.s. must be computed after allowing 
for block effects, as mentioned at the end of section 3.26. Secondly, 
if а observations are absent, che total number of degrees of freedom 
is reduced by a. Unless one or more complete treatments or blocks 
is missing, the number of parameters required to describe these effects 
will be the same as before. Consequently, the missing degrees of free- 
dom all come from the error s.s., which now represents (n, — a) degrees 
of freedom. Та short, missing data may be handled by applying the 
standard least squares procedure to all observations that are not miss- 
ing. For future reference, this method will be called the "correct least 
squares procedure." 

To the experimenter it may be a difficult business to carry out the 
construction and solution of a set of unfamiliar normal equations, even 
though he is quite competent to analyze a set of complete data. For this 
reason Yates (3.9), following a suggestion by Fisher, considered inserting 
values for the missing observations so as to obtain a set of complete data. 
Suppose that only a single observation is missing, and that a value z is 
substituted for this observation. If the analysis of variance is calculated 
in the usual manner for complete data, the error s.s. is found to be of the 
form 


Аа? — 2Br + С 


where А, В, and С are numbers that are determined by the type of de- 
sign and the values for the other observations (A is always positive). 

In order to find a numerical value for т, Yates proposed to use the 
value that minimizes the error s.s. This ist = B/A. If this value is in- 
serted in place of the missing observation, and if the data are analyzed 
as if no observations were absent, Yates showed that several important 
properties hold. (i) The estimates of treatment and block effects are 
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exactly the same as those obtained by the correct least squares pro- 
cedure. (ii) The error s.s. is exactly the same as given by the correct 
procedure. (iii) To obtain the correct partition of the degrees of freedom, 
we subtract 1 from the total s.s. and 1 from the error 5.5. 

Yates also showed that the method of insertion fails to agree with the 
correct least squares procedure in two respects. The treatments s.s., as 
obtained in the analysis of variance of the “eomplete” data, is always 
slightly larger than the correct treatments s.s. for an F-test of the treat- 
ments. Unless an appreciable fraction of the total observations is miss- 
ing, this overestimation is unlikely to be large; further, the exact F-test 
can be obtained by means of some additional calculations. The second 
defect of the method of insertion is that it may not give proper t-tests. 
That this will happen is clear, because in the analysis of “complete” 
data т replications are ascribed to the treatment that contains the missing 
observation, whereas there are only (r — 1) replications. To allow for 
this disturbance we give special rules which provide ¢-tests that are ap- 
proximately correct. 

Thus the method adopted with a missing value is first to “estimate” 
this value by means of the formula B/A, which will be presented for the 
common designs. This estimate is used in place of the missing value, and 
the rest of the analysis is conducted as if the data were complete (except 
for the changes in degrees of freedom). Special methods are available 
for exact F-tests and for t-tests. If several observations are absent, a 
repeated application of the formula enables values to be substituted for 
each missing observation. 

This method is essentially an ingenious computational device whose 
purpose is to enable the easy computations that apply to complete data 
to be used even when data are incomplete. Substitution of estimates 
for the missing data does not in any way recover the information that is 
lost through loss of data, as some experimenters have suggested, usually 
facetiously; it merely attempts to reproduce the results obtained by an 
application of the least squares method to the data that are present. 
The only complete solution of the “missing data” problem is not to have 
them. 


3.8 The Analysis of Covariance 


3.81 Purpose of the Technique. As indicated in section 3.2, experi- 
ments can be planned so that certain types of environmental effect are 
eliminated from the estimates of the treatment effects, with the result 
that these estimates are made more accurate. In the eelworm experi- 
ment, where the plots were grouped into blocks of 12 plots each, any dif- 
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ferences from block to block in the severity of eelworm infestation were 
eliminated in this way. On the other hand, differences in infestation 
from plot to plot within the same block are not controlled by the design and 
do contribute to the experimental errors, since treatments were assigned 
at random to plots within each block. Accordingly, before the fumigants 
were applied, samples were taken in order to estimate the natural in- 
festation on each plot. 

The analysis of covariance shows how to use these supplementary data 
to reduce the experimental errors by eliminating the effects of variations 
in the initial infestation within a block. The technique is potentially 
very useful. It often happens that some source of variation which can- 
not be controlled by the design can be measured by taking additional 
observations. Whenever this is so, the analysis of covariance can be 
utilized, often to great advantage. One caution is that, since the ad- 
ditional observations are to measure environmental effects, they must not 
be influenced by the treatments. The situation where such measure- 
ments are influenced by the treatments is discussed in section 3.88. 


3.82 Initial Steps in the Analysis. The first step is to construct a 
new mathematical model. If y;;; refers to the final eelworm count and 
Tijg to the initial, the relation is 


ук = w+ ri + Bj + (Rise — ® еж (3.21) 


The only change is the introduction of a new term to describe the effect 
of the initial eelvorm number. We have assumed that the effect is 
linear, i.e., it is a constant multiple у of the amount by which the initial 
eelworm number z;;; on the plot differs from the average initial number 
2 for the whole experiment. 

As before, the unknowns are estimated by least squares. In this 
case we minimize 


Dd line — m — t; — bj -с@ — OP (3.22) 
The normal equation for m is again of the form 
D [vin — m — ti — bj — сад — 8] = 0 (3.23) 


over all observations. If the same linear restrictions as before are ар- 
plied to the ¢; and the b;, equation (3.23) implies that m is the mean of the 
Vij. The equation for t; is the same as (3.23) except that the sum is over 
those observations that receive t; The equation may be rearranged as 


rm + та; = Та, — «(Т; - тұ?) (3.24) 
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where т; is the number of replications, and Т denotes a treatment total. 
This gives 


В "Тер 
mte 2E (A8) (3.25) 


The important feature of this equation is that in order to obtain t;, the 
observed treatment mean of ће y's (T;,/rj) is adjusted, the adjustment 
depending on the treatment mean of the z's. It is this adjustment that 
removes the effect of the initial infestation. 

The equation for с, the adjustment factor, will not be developed in 
full. It turns out that c is given by the ratio of the error sum of products 
of y and z to the error s.s. of x. 

3.83 Computations. In practice we start with a joint analysis of the 
sums of squares and products of y and z. In order to illustrate certain 
features of the covariance technique, the treatments s.s. will be sub- 
divided into "linear" and “curvature” components, as was done for y 
in table 3.5 (p. 70). The original values for z are given above the y 
values in table 3.1 (p. 46). The treatment totals and the quantities 


TABLE 3.8 TREATMENT TOTALS FOR NUMBERS OF CYSTS * 


Before fumigation (x) After fumigation (y) 


Level CN CS CM CK Totd| CN CS CM СК Tota 


0 1975 1975 5858 5858 
1 402 417 513 570 1902 | 1066 928 1431 802 4317 
2 880 554 568 778 2289 | 1265 877 1241 1122 4505 


2(2) + (1):L | 1180 1525 1649 2126 6480 | 3596 2682 3913 3136 13327 
2(1) —(2:Q| 415 280 458 362 1515 | 867 979 1621 662 4129 


* Numbers of cysts per plot were the numbers found in 400 grams of soil. Thus 


the totals in the table represent 1600 grams of soil, except for the no-fumigant totals, 
which represent 6400 grams. 


needed for isolating the linear response and the curvature are reproduced 
in table 3.8. The analysis of the sum of Squares of z follows by the 
same methods as given previously for y. 

To analyze the sum of products, we carry out the same operations as 
for a sum of squares, except that at every stage a square is replaced by 
the corresponding product. A few examples will suffice. 
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Total: (269)(466) + (283)(280) +- - -+ (48)(298) — eso) = 355,929 
Т (1975)(5858) (402)(1066) Рае (778)(1122) 
16 4 4 
“ nd = —9222 
Average linear response: (2280. 1979 (4508. 1509) = —13,276 


32 


The complete analysis of covariance is shown in table 3.9. 


TABLE 3.9 Sums оғ SQUARES AND PRODUCTS (т = BEFORE, у = AFTER, 


FUMIGATION) 

df. (ғ? (yz) 0) 
Blocks 3 159,618 175,873 289,427 
Treatments 8 29,142 —9,222 157,448 
Average linear response 1 3,081 —13,276 57,207 
Average curvature L 2,204 8,285 31,140 
Differences in linear 3 22,975 —6,837 43,408 
Differences in curvature Б] 882 2,606 25,698 
Error 36 121,408 189,278 544,690 
Total 47 310,168 355,929 991,565 


If we denote the sums of squares and products for error by Ezz, Eyy 
and Ёл, respectively, the adjustment factor с, or regression coefficient 
of y on z, is given by 

Eyz 189,278 


c = -Z = ———— = 1.559024 
E, 191,408 


It may seem surprising that the coefficient is greater than 1. This is 
probably explained by the great seasonal increase in the eelworm num- 
bers, obvious from table 3.8. 
The residual error s.s. may now be found. The original error s.s. is 
544,690, with 36 d.f. To remove the effect of the regression on the 
initial eelworm numbers, we subtract 
Е,2 (189,278)? 
E. 121408 
By? 
Ezz 

Thus the residual s.s. is 249,601, with 35 d.f., since 1 d.f. must be sub- 


= 295,089 


Hy, — —— = 544,690 — 295,089 = 249,601 
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tracted for the additional parameter y. The residual m.s. Syz? is 
249,601/35, or 7131. This is less than half the original mean square 
of 15,130, and indicates that the use of covariance has approximately 
doubled the accuracy of the experiment. 

One feature of the covariance method deserves comment at this point. 
In the model the effect of x was assumed to be linear, but no assumption 
was made about the strength of the effect. That is left to be determined 
by the data. If the eelworm numbers on each plot had remained un- 
changed throughout the season, except for the influence of the treat- 
ments, and if the eelworm numbers had been accurately measured, the 
residual s.s. would have been zero. The residual s.s. as found presumably 
represents а contribution due to seasonal variations in infestation and 
one due to the fact that the eelworm numbers were not estimated ac- 
curately, being obtained only from small samples of soil. If the variable 
z had had no linear effect on y, the residual m.s. would have been the 
same as the original mean square, apart from sampling fluctuations. 


3.84 The Adjusted Treatment Means. А little time is saved if the 
adjusted treatment totals are found first. From equation (3.24) the 
adjustment to the 7th treatment total of y is 


-«<(Т,- rii) 


where r; is the number of replicates, and 2 the general mean of x (128.46). 
Thus for the control the adjusted total is 


5858 — 1.5590(1975 — 16 X 128.46) = 5858 + 125 = 5983 
and for (1CN) 
1066 — 1.5590(402 — 4 х 128.46) = 1066 + 174 = 1240 


The means are shown in table 3.10. 


TABLE 3.10 ADJUSTED TREATMENT MEANS (EELWORMS PER 400 ом. воп,) 


Fumigant 
Level of с 
dressing CN Cs CM CK 
0 374 
1 310 270 358 201 
2 365 204 289 178 


3.85 t-tests. These tests are slightly complicated by the fact that 
account must be taken of the sampling error of the adjustment factor с. 
The difference between two adjusted means may be written 


йз — Yo — c(% — 22) 
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From regression theory the variance of this quantity is given by 

1 (Һ-%ы) 
+ 2. 

T2 Ezz 


Y „Л, 1 
буз? (= + ~) + (& — Ehe) = сш? [5 + | (3.26) 


Fe. 
where ду. is the residual error variance, and Ezz the error s.s. for z. Ав 
an estimate of су.:? we use the residual error m.s., in this case 7131 with 
35 d.f. 

Example. A t-test of the reduction to the double dressing of CS. The re- 
duction is (374 — 204), or 170 eelworms per sample. The estimated 
variance is 


1 1 (123.44 — 138.50)? 


| = (7131) (0.31437) = 2242 


121,408 
(3.27) 
the means of the z's being found by division from table 3.8. Hence 
и 170 Е 170 = 3.59 
2 4/92 4735 “ 
with 35 d.f., which is highly significant. 
For a more complicated comparison of the adjusted means, 
È lig: — с — 2)] 
the estimated variance is 
12 ly 
Sya? [= XE a (3.28) 
Ti Ezz 


(The term in & vanishes because for any comparison the sum of the Ёз 
must be zero.) 

One annoying feature of these tests is that the z values enter into the 
variance, so that every comparison necessitates a separate computation 
of the variance. As a time-saving approximation, Finney (3.10) has 
proposed that an average value for the contribution of the term in the 
28 may be used. This amounts to using 


А | = (7131)(1.030) = 7345 


і 
28 Æ | = (7131 [ ——— 
2 E t | КыЛ 121,408 


zz 
as the effective residual error m.s., where tsz is the treatments mean square 
for æ. The term in brackets represents the average contribution from 
the z's, or in other words the contribution from sampling errors in the 
adjustment factor c. 
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Thus, for the variance of the difference between the adjusted means 
of the control and 2С8, we would use 


1 Tes 
(7345) (с + ~) = (7345) e + :) = 2295 
Ty 7% 16 4 


instead of the value 2242 given by the more exact expression (3.27). 
Similarly, instead of (3.28), we use 


(7345) У) (3 


This approximation is usually good enough if the number of error de- 
grees of freedom exceeds 20, since in such cases the contribution from 
errors in c is small. The more exact test may be used if n, is small. 


3.86 F-tests. The fact that the same adjustment factor appears in 
every treatment mean also influences F-tests. The procedure is shown 
in table 3.11. 


TABLE 3.11 F-rEST WITH THE ANALYSIS OF COVARIANCE 


Residuals 
————— 
d.f. (а?) (yz) (у) fat. 8.8. m.s. 
Treatments 8 29142 —9,222 157,448 | 8 237,192 29,649 
Error 36 121,408 189,278 544,690 | 35 249,601 7,131 


Т+Е 44 150,550 180056 702,138 | 43 486,793 


The figures to the left of the vertical line are from the previous analysis 
(table 3.9), and in practice would not be recopied. Form a new line in 
the analysis by addition of the items for treatments and error From 
this value for (у?), i.e., 702,138, subtract the contribution due to a re- 
gression on т, 

(180,056)? 


150,550 


Тһе remainder, 486,793, is entered in the column headed “residuals в.в.” 
and carries 43 d.f., 1 being subtracted for the regression. The same 
process is completed for the error line; actually, this was already done in 
computing the residual error ss. The residual ss. for treatments is 
found by subtracting that for error from that for treatments -+ error 


237,192 — 486,793 — 249,601 


= 215,345 
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It always has the same number of degrees of freedom as the original 
treatments s.s. The F-test of the adjusted treatment means is given by 
the ratio of the residual m.s. 


with 8 and 35 d.f. 

If it is desired to test some component of the treatments s.s., the same 
calculation is made with the component in place of the treatments s.s. 
throughout. If several components are to be tested, this becomes rather 
tedious. A useful approximation is to construct from the original anal- 
ysis of covariance in table 3.9 an analysis of (y — cz)?. This is most 
easily done by multiplying each term in (2°) by с?, or 2.43048, each term 
in (yx) by —2c, or —3.11805, and adding the two products to (y?). The 
results are shown in table 3.12. 


TABLE 3.12 ANALYSIS OF VARIANCE OF (y — cz) 
d.f. 8.8. m.s. Е p 


"Treatments 8 257,032 32,129 
Average linear response 1 106,090 106,090 14.88 1451 
Average curvature 1 10,664 10,664 1.50 147 
Differences in linear 8 120,566 40,189 5.64 5.05 
Differences in curvature 3 19,711 6,570 0.92 0.92 
Error 35 249,592 7,131 


It will be noted that this calculation gives the correct residual error 
m.s. However, sums of squares for components of the treatments are 
always larger than those given by the more roundabout correct рго- 
cedure. Thus the F values shown on the right are all too large. The 
overestimation is seldom great, and the F values serve for a preliminary 
inspection. Those that are just beyond the significance level may be 
recomputed by the correct procedure if it is thought worth while. For 
comparison, the F’ values shown above are those obtained by the 
correct procedure (it is а useful exercise to check them). With either 
F or F' there is а significant average linear response and significant 
differences among fumigants in their linear responses, while the curva- 
ture terms do not approach significance. In the analysis made without, 
covariance (table 3.5, p. 70) no component was significant. 


3.87 The Increase in Accuracy Due to Covariance. From table 3.9 
we see that with no covariance the error s.s. for y is 544,690, with 36 
d.f., giving an error m.s. of 15,130. А comparable figure when covariance 
is used is the effective residual m.s., 7345. This is preferable to the 
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residual m.s. itself (7131) because it makes allowance for the sampling 
error of the adjustment factor c. The accuracy obtained with covariance 
relative to that without covariance is estimated by 15,130/7345, or 2.06. 
The use of covariance appears to have had about the same effect as 
doubling the number of replicates. In making this comparison, we ig- 
nored the effect of the reduction in error d.f. from 36 to 35, because it is 
negligible. 

As in ordinary regressions, the т variable may be transformed to an- 
other scale if this is likely to produce a more linear relation with y. The 
use of log z, for instance, is common in biological work. Two or more 
different z variables may be used. The calculations for this case are 
described by Snedecor (3.4, chapter 13). 


3.88 The Case Where the x Variable is Influenced by the Treatments. 
Tn a covariance analysis the treatment mean $: is adjusted by the amount 
—с(5; — 2). The effect of the adjustment is to change each 7; to the 
value that it would be expected to have if all treatments had the same 
т mean. It is in this way that the technique removes the effect of 
variations in the Z;. If, however, the differences among the 2; are in part 
produced by the treatments, the adjustment removes part of the treat- 
ment effect and its interpretation is changed. Consequently, in the 
standard use of covariance, it is important to be sure that the treatments 
did not affect the 2 values. Sometimes this is obvious, as in the eel- 
worm experiment, because the т values were recorded before the treat- 
ments were applied. A more doubtful case is that of the number of 
plants per plot in a field experiment, counted after the application of 
the treatments, which may or may not influence plant numbers. An F- 
test of the z values is helpful in such cases, 

Where the treatments do affect x, a covariance analysis may add in- 
formation about the way in which the treatments produced their effects. 
In the eelworm experiment, the yields of the oats which were grown on 
the plots were obtained. Since eelworms attack oats, it would be in- 
teresting to know whether the effects of the treatments on the oats were 
simply a reflection of their effects on the eelworms. This is examined 
by a covariance analysis in which the oats yields are the y values, and 
the eelworm numbers after harvest are the 2 values. If the F-test of the 
adjusted treatment means still shows significance, the conclusion is that 
not all the treatment effect on the oats can be attributed to the reduction 
in eelworm numbers. This happened in the present instance, because 
some treatments supplied nitrogen to the crop, and therefore acted in 
part as fertilizers as well as fumigants. As Bartlett (3.11) has pointed 
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out, the interpretation of this use of covariance requires care, since a 
hidden extrapolation may be involved. 

When the х variables are influenced by treatments, the short-cut 
method for t-tests by use of the effective residual error and the approxi- 
mate F-tests by means of an analysis of (y — cx) should be avoided, 
since they may be seriously in error. 


3.9 Effects of Errors in the Assumptions Underlying the Analysis of 
Variance 


The assumptions made in the analysis of variance are that treatment 
and environmental effects are additive, and that the experimental errors 
are independently distributed in the normal distribution, with a common 
variance. In practice we can never be sure that these assumptions all 
hold, and often there is good reason to suspect that some are false. The 
consequences of failures in the assumptions and the remedial steps to 
be taken have been summarized by Eisenhart (3.12), Cochran (3.13), 
and Bartlett (3.14). Only a few comments will be given here. 

As a rule, the failure of an assumption will affect both the significance 
levels and the sensitivity of F- and t-tests. When the experimenter 
thinks that he is testing at the 5% level, he may actually be testing at 
the 8% level. Usually, though not invariably, the true significance 
probability is larger than the apparent one; that 13. too many significant 
results are obtained. Also, there is usually a loss of sensitivity, in the 
sense that a more powerful test than the analysis of variance F-test 
could be constructed if the correct mathematical model were known. 
There is a corresponding loss of accuracy in the estimates obtained for 
the treatment effects, since these, too, could be made more accurate if the 
correct model were known. 

Although generalization is hazardous, experience suggests that in the 
majority of experiments, at least in the field of biology, these disturb- 
ances are not sufficiently great to invalidate the technique. They do 
imply, however, that significance levels and confidence limits must be 
considered approximate rather than exact. For the same reason, the 
inflexible use of say the 5% significance level to divide the effects into 
those that are regarded as “real” and those that are not is hardly jus- 
tifiable. 

The most serious disturbances appear to arise when the experimental 
error variance is not constant over all observations. Sometimes this 
happens, as mentioned previously, because certain treatments are erratic 
in their effects. In such cases, the appropriate error variance for com- 
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paring one pair of treatments might be four times as large as that for 
another pair, and the use of the same estimated variance for both com- 
parisons would lead to t-tests that were completely erroneous. Where 
this type of disturbance is suspected, the remedy is to divide the error 
8.3. into components each of which is homogeneous, as indicated in 
section 3.63. 

The same problem may arise because the experimental errors follow a 
distribution that is decidedly skew. In such distributions the error 
variance for a treatment tends to be a function of the mean produced by 
the treatment. If the nature of the functional relationship is known, a 
transformation can be found that will place the data on a scale on which 
the error variance is more nearly constant. This transformation is then 
made on the observations before starting the analysis. The principal 
transformations that have been found useful are discussed by Bartlett 
(3.14); they include logs, square roots, and (for data expressed in frac- 
tions) inverse sines. 

Such transformations may also be useful in cases where treatment and 
environmental effects are not additive. If, for instance, a treatment in- 
creases all observations by 20%, irrespective of the initial level, a change 
to logs will introduce additivity. When transformations are made for 
this purpose, it should be realized that they will also affect the distribu- 
tion of the experimental errors. Fortunately, it often happens that such 
transformations also bring the distribution of errors closer to normality. 

Finally, the assumption that the errors are independent from observa- 
tion to observation may be obviously untenable. It is well known that 
crop yields on neighboring plots tend to be positively correlated, and in 
laboratory experiments observations made by the same person at about 
the same time tend to exhibit the same type of correlation. These cor- 
relations might completely vitiate tests of significance. The remedy in 
this case is the proper use of randomization, which, as it were, introduces 
independence in the assignment of treatments to the experimental units 
or in the assignment of the order in which observations are made, so that 
the errors may effectively be regarded as independent. For further dis- 


cussion of this question, see Yates (3.15), Fisher (3.3), Bartlett (3.10), 
and Welch (3.17). 
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СНАРТЕВ 4 


Completely Randomized, 
Randomized Block, 
and Latin Square Designs 


4.1 Completely Randomized Designs 


4,11 Description. The simplest type of layout is that in which treat- 
ments are allotted to the units entirely by chance. More specifically, if 
a treatment is to be applied to four units, for example, the randomization 
gives every group of four units in the experimental material an equal 
probability of receiving the treatment. In addition the units should be 
processed in random order at all subsequent stages of the experiment 
where this order is likely to affect the results. 

This design has several conveniences: 

1. Complete flexibility is allowed. Any number of treatments and 
of replicates may be used. The number of replications can be varied at 
will from treatment to treatment (though such variation is not recom- 
mended without good reason). All the available experimental material 
can be utilized—an advantage in small preliminary experiments where 
the supply of material is scarce. 

2. The statistical analysis is easy even if the numbers of replicates are 
not the same for all treatments or if the experimental errors differ from 
treatment to treatment. 

3. The method of analysis remains simple when the results from some 
anits or from whole treatments are missing or are rejected. Moreover, 
the relative loss of information due to missing data is smaller than with 
any other design. 

The principal objection to a completely randomized design is on the 
grounds of accuracy. Since the randomization is not restricted in any 
way to ensure that the units which receive one treatment are similar to 
those which receive another treatment, the whole of the variation among 
the units enters into the experimental error. For this reason the error 
can often be reduced by the use of a different design, unless the units are 
highly homogeneous or the experimenter has no information by which 
to arrange or handle the units in more homogeneous groups. 


Complete randomization seems the obvious procedure for many lab- 
95 
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oratory experiments, e.g., in physics, chemistry, or cookery, where a 
quantity of material, after thorough mixing, is divided into small sam- 
ples or batches to which the treatments are applied. On the other hand, 
these designs are seldom used in field experiments, the method of ran- 
domized blocks having been found consistently more accurate. 

One fact compensates to some extent for the higher experimental er- 
rors as compared with other designs. For a given number of treatments 
and a given number of experimental units, complete randomization pro- 
vides the maximum number of degrees of freedom for the estimation of 
error. As pointed out in section 2.31, the sensitivity of the experiment 
increases as the number of error degrees of freedom is increased. Con- 
sider, for example, an experiment with 2 treatments in 4 replicates. The 
degrees of freedom for error are 6 under complete randomization as 
against 3 if the method of pairing (section 4.2) is used. From section 
2.31 it appears that the paired experiment must produce about a 14% 
reduction in the error variance in order to offset the additional un- 
reliability in the estimate of error. This point is worth. bearing in mind 
with small experiments. 

To summarize, complete randomization may be appropriate (i) where 
the experimental material is homogeneous, (ii) where an appreciable 
fraction of the units is likely to be destroyed or to fail to respond, and 
(iii) in small experiments where the increased accuracy from alternative 
designs does not outweigh the loss of error degrees of freedom. 


4.12 Randomization. If the number of units does not exceed 16, 
tables 15.6 and 15.7 may be used. Suppose that there are 3 treatments, 
of which two have 4 replicates while the third has 8 replicates. Num- 
bers 1-16 are assigned to the units in any convenient order. A random 
permutation is drawn from table 15.7, say 


9, 13, 8, 5, 12, 1, 14, 16, 6, 7, 3, 4, 10, 11, 15, 2 


The first treatment is applied to units 9, 13, 8, 5; the second to units 12, 
1, 14, 16; and the third to the remainder. 

With more than 16 units in the experiment, numbered discs, beans, 
or a book of random numbers may be used, as described in chapter 15. 


4.13 Statistical Analysis. Table 4.1 shows some of the results of an 
experiment on the effects of applications of sulphur in reducing scab dis- 
ease of potatoes. The object in applying sulphur is to increase the acidity 
of the soil, since scab does not thrive in very acid soil. In addition to 
untreated plots which serve as а control, 3 amounts of dressing were 
compared—300, 600, and 1200 Ib. per acre. Both a fall and a spring ap- 
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plication of each amount was tested, so that in all there were 7 distinct 
treatments. The sulphur was spread by hand on the surface of the soil, 
and then disced in to a depth of about 4 inches. The quantity to be 


TABLE 4.1 FIELD PLAN AND SCAB INDICES FOR A COMPLETELY RANDOMIZED 
EXPERIMENT ON POTATOES 
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Notation: F = fall, S = spring application, 0 = control. The numbers 3, 6, 12 
are the amounts of sulphur in 100 Ib. per acre. 


Results grouped by treatments 


0 ЕЗ 83 F6 86 F12 812 
12 30 9 30 16 18 10 17 
10 18 9 7 10 24 4 7 
24 32 16 21 18 12 4 16 
29 26 4 9 18 19 5 17 
Totals 181 38 67 62 73 23 57 G = 501 
Меапв 22.6 9.5 16.8 15.5 18.2 5.8 14.2 


Analysis of variance 


Source of variation d.f. 5.8. m.s. F 
Treatments ((—1 = 6 972.8 162.0 3.61 * 
Егтог (М-д = 25 1122.9 44.9 

Total (N — 1) = 31 2095.2 


* Denotes significance at the 5% level. 


analyzed is the “scab index." This is, roughly speaking, the percentage 
of the surface area of the potato that is infected with scab. It is obtained 
by examining 100 potatoes at random from each plot, grading each po- 
tato on a scale from 0 to 100% infected, and taking the average. 
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The design was a completely randomized one, with 4 replications of 
each sulphur dressing and 8 replications of the control, which received 
extra replication in order to obtain a fairly good estimate of the natural 
infestation. Incidentally, a randomized blocks layout might have been 
superior. 

The computations are very simple. Let у denote an observation, Т; a 
treatment total, G the grand total, 7; the number of replications of the 
ith treatment, № = У) r; the total number of observations, and ¢ the 
number of treatments. 

Step 1. Find the treatment totals and the grand total. 

Step 2. The sums of squares are computed as follows. 


6% (501? 
i s m 8 = 7843.8 
Correction factor: С N 32 78. 
Total: $5 y? — € = (9? + (12)? +.--+ (4)? — 7843.8 = 2095.2 
2 1)? 2 ae .. 2 

Treatments: >> -C= cap + (8850 z + 6n — 7843.8 = 9723 
3 

Error: (Total s.s.) — (treatments s.s.) = 2095.2 — 972.3 = 1122.9 


The analysis of variance is given at the foot of table 4.1. The F- 
ratio for treatments is significant at the 5% level. From the treatment 
means it appears that all dressings had some beneficial effect, and that 
the fall application was more effective than the spring one. There is 
little or no evidence that the higher dressings were more effective than 
the lowest dressing. If the summary is conducted from this point of 
view, we should isolate and test two individual components of the treat- 
ments s.s.: (i) a component measuring the average effect of all dressings, 
(ii) a component comparing the fall and spring applications. The follow- 
ing computations are required, 

Average effect of sulphur. The total over all dressings is 320, repre- 
senting 24 plots. Since the control total, 181, represents 8 plots, the 
comparison is 

3(181) — 320 = 223 
The contribution to the sum of squares in the analysis of variance is 
(223)2/96, or 518.0. By the rules in section 3.42, the divisor, 96, is 
[(9) (8) + 24]. 


Fall versus spring application. The comparison is 
(38 + 62 + 23 — 67 — 73 — 57) = -74 


It is obviously orthogonal to the previous comparison. The contribu- 
tion to the sums of squares is (74)?/24, or 228.2. These calculations 
lead to the analysis of variance in table 4.2. 
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The remaining four components of the treatments s.s. must represent 
comparisons among the levels of sulphur. The average reduction in scab 
due to sulphur is significant at the 1% level, while the superiority of the 
fall application is also significant. Differences among the levels show no 


sign of significance. 


TABLE 4.2 SUBDIVISION OF THE TREATMENTS SUM OF SQUARES IN THE 
EXPERIMENT ON POTATO SCAB 


Source of variation d.f. 8.8. m.s. F 

"Treatments 6 972.3 162.0 3.61* 

Control vs. sulphur 1 

Fall vs. spring application 1 

Comparisons among levels 4 226.1 56.5 1.26 
Error 25 
* Denotes significance at the 5% level. 
** Denotes significance at the 1% level. 


The conclusions might be phrased as follows. “Тһе application of 
sulphur produced a significant decrease in the scab index, the averages 
being 22.6 for the untreated plots, 16.4 for plots with the spring applica- 
tion, and 10.2 for plots with the fall application. The fall application 
proved significantly better than the spring application. There was no 
indication that the higher levels of dressing were more effective than the 
lowest level.” It should be remarked that a continuation of this experi- 
ment and other experiments caused these conclusions to be modified, 
the higher levels having shown to better effect. 

A more extensive discussion of the analysis of experiments of this type 
is given by Snedecor (4.1). Note that, if certain observations are missing 
or have to be rejected, the method of computation remains exactly the 
same except for the slight complication that different treatments will 
usually have different numbers of replications. 

The practice of calculating a separate standard error for each treat- 
ment is still found in some types of experimentation. Unless the treat- 
ments have different error variances, the use of a pooled error, as given 
by the analysis of variance, is recommended since more sensitive tests of 
Significance are obtained. In cases where the experimental error appears 
to vary considerably from treatment to treatment, a test of homogeneity 
of the error variance, due to Bartlett (4.2), may be made. This test is 
also described in Snedecor (4.1, section 10.13). 

It is worth emphasizing that, even if the treatments are randomly as- 
signed to the units, the above methods of analysis do not apply when 
restrictions on the randomization are introduced at a later stage in the 
experiment. Consider a chemical experiment in which samples of liquid 
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from the same bottle are treated in four different ways. The samples, 
being mutually indistinguishable, are assigned at random to the treat- 
ments. It is, however, practicable to treat only a few samples during 
each work period, and the investigator decides to complete a single 
replication at each session. The result of this additional control is that 
the differences among replicates are eliminated from the experimental 
errors and must equally be eliminated from the estimated errors. The 
analysis should follow the method described in section 4.2. This point 
will arise frequently in experiments where many physical operations 
must be carried out. 


4.14 Standard Errors. The estimated standard error of the difference 
between two treatment means is 


Ж); (2 -) 
а s 6 
71 72 


This formula applies when a pooled error is used, s? being the pooled 
error m.s. per unit, and г; and тә the numbers of replicates for the two 


treatments. Ifr, = тә = г, the formula reduces to za * The degrees 
М T 


of freedom for t-tests are those in 52. 

Example. "The average effect of the sulphur has already been tested 
by an F-test. We will perform the same test by means of a t-test applied 
to the treatment means. Since the control has 8 replications, and the 
mean of all sulphur dressings has 24 replications, 


sa = V (44.9) ($ + =) = 2.736 


The mean scab index for the control is 22.62, and that for the dressings 
is 13.33. Hence 
22.62 — 13.33 


t = ——_____ = 
2736 3.395 


with 25 d.f. The value of ё is 11.53, in agreement with the F found in 
the analysis of variance. 


If the true error variances per unit are considered to be different for 
the two treatments, the appropriate formula is 


2 2 

81 $2 
т pees 
71 72 


where 317, 34? are the respective error m.s. per unit for the two treatments 
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Tn this case the ratio of the treatment difference to s; does not follow 
Student's distribution except in special instances. The development of 
correct significance levels for this case presents a problem that has stim- 
ulated much discussion in recent years. Various methods that have been 
suggested agree closely in practical application except when the numbers 
of degrees of freedom іп s,? and 82? are small. Fisher and Yates (4.3, 
tables V1 and V2) give tables of the significance levels of 4/54 derived 
from Fisher’s theory of fiducial probability. (Note that their sı? cor- 
responds to our s?/r,, etc. Where these tables are not readily ac- 
cessible, the following approximation is suggested. It probably errs 
slightly on the conservative side, in the sense that the value of { required 
for significance may be slightly too high. 

Let nı, по be the numbers of degrees of freedom in 812, 822, respectively. 
From the ordinary i-table record the significance levels tı, tg correspond- 
ing to nı and ng degrees of freedom, respectively. The approximate sig- 
nificance level for the ratio d/s; is 


Ра Wil, + Wel, 


ші + We 
where 


Since // always lies between the ordinary ¢ values for n; and na degrees 
of freedom, this ealeulation is needed only for those occasional cases 
Where d is close to the borderline of significance. Further, when n; = 
то = n, Р is the ordinary t value for n degrees of freedom. 

Example. This experiment is a case where one might suspect that the 
error variance would not be homogeneous. If a treatment is very suc- 
cessful in reducing scab, the scab indices for that treatment will all be 
close to zero; hence their variance must be small. With a treatment 
such as the control the scab index has a greater possible range of vari- 
ation. Consequently, we might expect the error variance for a treatment 
to depend on the mean produced by the treatment. With only 3 d.f. 
for the estimated error variance per treatment, we cannot hope to detect 
small differences in the true error variances. Perhaps the most important 
comparison is that between the variance for the control and the average 
variance for the sulphur dressings. The error m.s. will be found to be 
70.0 for the control (with 7 4.1.) and 35.2 for the dressings (with 6 Х 3 
ог 18 44). The F-ratio, 1.99, falls just short of the 10% level of sig- 
nificance. 

Consequently, the conclusions will be on & sounder basis if we ascribe 
different true error variances to the control and the dressings. For a 
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t-test of the average effect of sulphur we now take 
70.0 35.2 
sa = ./— + — = V8.75 + 1.47 = 3.197 
5 8 24 


The value of t is 9.29/3.197, or 2.906. To test this at the 1% level by the 
approximate test above we have 


т = 7; m = 18; t = 3.499; t = 2.878; ш = 8.75; we = 1.47 
Hence the value required for significance is 

(8.75) (3.499) + (1.47)(2.878) 
1 10.22 


ғ 


= 3.410 


The average effect of sulphur now falls short of the 1% significance level. 
It remains significant at the 5% level. For comparisons among the dif- 
ferent sulphur dressings we would use ordinary t-tests, where s? is taken 
as 35.2 with 18 d.f. The effect will be to enhance the significance of 
comparisons among the dressings, though the conclusions quoted pre- 
viously are not altered. 

As mentioned in section 3.9, it is usually better to handle this type of 
experiment, where the error variance is thought to be a function of the 
mean, by converting the original data to a scale on which the error vari- 


TABLE 4.3 ANALYSIS OF VARIANCE OF THE SQUARE ROOTS'OF THE SCAB INDICES 


Source of variation d.f. 8.8. m.s. F 
Treatments 6 18.22 3.04 4.19 
Control vs. sulphur 1 8.17 8.17 11.27 
Fall vs. spring application 1 4.59 4.59 6.33 
Comparisons among levels 4 5.46 1.36 1.88 
Error 25 18.12 0.725 
Error for controls 7 6.27 0.896 
Error for dressings 18 11.85 0.658 
Total 31 36.34 


ances will be more nearly constant. On the supposition that the error 
variance may be proportional to the mean, the square roots of the scab 
indices might have been analyzed. As a matter of interest this analysis 
is ame in table 4.3; square roots were recorded to only one decimal 
place. 

The error m.s. for the controls is now closer to that for the dressings, 
the F-ratio being 1.36 against 1.99 in the original analysis. Further, the 
F-ratios fo: most components of the treatments s.s. are higher than in the 
original analysis, suggesting an increase in the sensitivity of the analysis. 
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4.15a Statistical Analysis with Data Arranged in Two Classes. In 
many completely randomized experiments, the data are not measured 
on а continuous scale, but are merely classified into two classes, e.g., 
“suegess” or “failure,” “dead” or “alive.” In this event the statistical 
analysis is carried out by the standard x? tests for a 2 X t contingency 
table. Table 4.1a shows the results of an experiment at the Eversley 
Childs Sanatorium, Cebu, Philippine Islands, in which 6 drugs for the 
treatment of leprosy were compared (4.1а). At the beginning of the 
experiment and at the end of 48 weeks of treatment each patient was 
examined by a physician who did not know which drug the patient was 
receiving. The data for each treatment are the numbers of patients 
classified as “improved” or “not improved.” Initially the experiment 
was arranged in randomized blocks (section 4.21), but the grouping into 
blocks appeared to be without effect. 


TABLE 4Ла NUMBERS OF PATIENTS "IMPROVED" AND “NOT IMPROVED” 


Treatment 
A B с р Е F Total 

Imp. (а) 16 13 15 8 15 2 69 (Za;) 
Not 38 34 37 45 34 50 238 
Total 54 47 52 53 49 52 307 
Proportion 
improved 

(pi) 0.2963 0.2766 0.2885 0.1509 0.3061 0.0385 | 0.22476 (p) 


The proportion p; of patients improved under each treatment is shown 
at the foot of the table. These are the figures by which the clinical effec- 
tiveness of a treatment is judged. 

The x? test for this 2 X 6 contingency table is the analogue of the 
F-test with continuous data: it tests the null hypothesis that there are 
no differences in the true proportions improved under the different treat- 
ments. The value of x? is often computed from the observed (0) and 
the expected (E) numbers in each cell, using the formula 


x = È (0 – EY/E 


Since the proportions improved р; must be calculated in order to look 
at the treatment effects, the equivalent formula of Brandt and Snedecor 
is more expeditious (4.1). 


2 Dap; — BX (a) 
а) 
24 
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where a; = number improved under the 7th treatment. 
p = $27 = overall proportion improved. 


а=1- р. 
For these data, 
2. (16)(0.2968) + (13)(0.2766) + + + (2)(0.0885) — (69)(0.22476) 
а (0.22476) (0.77524) 
= 1740 (5 df.) 


This is significant at the 5% level. With this method the p; must be 
carried to extra decimal places to avoid loss of accuracy in х2. 

Note that if all treatments have the same number r of replicates, the 
Brandt-Snedecor formula simplifies to 


х = È (а – @)?/rpg 


The next step is to compare the individual treatments. This is done 
by breaking down the contingency table into smaller tables. Treatments 
A and В were two types of sulfone drug; in treatments С and Е the 
active ingredient was dihydrostreptomycin sulphate; treatment D was 
the combination of streptomycin plus sulfone; and treatment F was a 
control that did not contain any active agent against the leprosy bacillus. 
A rational series of comparisons is as follows: 


Active drugs vs. Control (А +++ Е) ув. Е 
Sulfones vs. Strep. уз. Combined (А, В) уз. (С, Е) vs. D 
Between Sulfones A vs. B 

Between Streps. C vs. Е 


The contingency tables required for this breakdown appear in table 
4.28. The value of x? is computed separately for each table and is shown 
under the table. The correction for continuity should be used in all 
2 X 2 tables, but not in larger tables. Note that these x? values will 
not add up to the total x^ of 17.40, even though all 5 d.f. have been 
accounted for. 

The results are that the combined'drugs A --- E show a significantly 
higher percentage improvement than the control (x? = 11.21, 1 44). 
No differences in effectiveness among the drugs can be detected. The 
latter result is fairly obvious from inspection of the original table 4.1a, 
erg complete breakdown is given here primarily to exhibit the 
method. 
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TABLE 4.2a BREAKDOWN OF CONTINGENCY TABLE FOR DETAILED COMPARISONS 


Active drugs vs. Control Sulfones vs. Streps. vs. Combined 
A.E F | Total A,B СЕ .D | Total 
Imp. 67 2 69 Imp. 29 30 8 67 
Not 188 50 238 Not 72 71 45 188 
Total | 255 52 307 Total| 101 101 53 255 
xe? = 11.21 (144) xi-434 (244) 
Between Sulfones Between Streps. 
A B | Total с Е | Total 
Imp. 16 13 29 Imp. 15 15 30 
Not 38 34 72 Not 37 34 71 
Total | 54 47 101 Total | 52 49 101 
xe? = 0.00 (1 df.) x’ = 0.00 (144) 


If the treatments represent increasing amounts of a substance, the 
primary interest usually lies in the curve of relation between the р; 
and the corresponding amounts 2;. When this relation is likely to be 
approximately linear, we may wish to subdivide x? into a single d.f. 
representing the linear regression and а remainder representing devia- 
tions from the regression. The same device is useful when the treat- 
ments are ordered (e.g. mild, moderate, and intense background noise 
under which some task has to be performed) if it is reasonable to assign 
scores x; to the levels of treatment. 

For the ith treatment, let p; — a;/n;. To compute the single d.f. 
x? for regression, find the quantities 


T (Zaj) (nz) 
алану. 
(Хт)? 
“ pe — 
D = Уп; (n) 


These are, respectively, the numerator and denominator of the regres- 
sion coefficient b of p; on z;. Then the x? for linear regression is 


with 1 d.f., where р = (Za;)/(Zn,). If the true relation is close to linear, 
xi? may reveal a significant relationship when the total х? does not. 
The remainder, x? — x;?, with (t — 2) d.f., serves as a test of deviations 
from linearity, where ¢ is the number of treatments. 
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42 Single Grouping: Randomized Blocks 


421 Description. The essence of this design is that the experimental 
material is divided into groups, each of which constitutes a single trial 
or replication. At all stages of the experiment the object is to keep the 
experimental errors within each group as small as is practicable. Thus, 
when the units are assigned to the successive groups, all units which go 
in the same group should be closely comparable. Similarly, during the 
course of the experiment, a uniform technique should be employed for 
all units in the same group. Any changes in technique or in other con- 
ditions that may affect the results should be made between groups. 

This division into replications need be recognized only at those stages 
in the conduct of the experiment where the division may help to reduce 
experimental errors. In agricultural field experiments the division is 
made at the start when the plots are marked out in the field. Since 
neighboring plots are known to be more alike in fertility than plots some 
distance removed, each replicate consists of a compact group of plots 
and is made approximately square in shape if feasible. Cultivations 
designed to keep the land clean of weeds will usually be carried out with- 
out regard to the replications, because it is not believed that results are 
affected by the order in which plots are cultivated. Similarly, the plots 
will generally be harvested in whatever order is most convenient. If, 
however, harvesting must be spread over a number of days, it is well to 
harvest the plots replication by replication, in case rainfall or other 
factors should produce changes in the weight of the сгор from day to 
day. In other types of experimentation no real distinction might be 
made between replicates until relatively late in the experiment. 

The principal advantages of randomized blocks are as follows. 

1. By means of the grouping, more accurate results are usually ob- 
tained than with completely randomized designs. 

2. Any number of treatments and any number of replicates may be 
included. With the design as described above, each treatment will have 
the same number of replicates. If extra replication is desired for some 
treatments, each of these may be applied to two units within every 
group. This device provides twice the standard number of replicates 
for the treatments in question, at the expense of some increase in the size 
of the group, which now contains more than a single replication. Sim- 
ilarly a treatment may be applied three or four times in a group. 

3. The statistical analysis is straightforward. Mishaps which neces- 
sitate the omission of a complete group or of the entire data from one от 
more treatments do not introduce any complication in the analysis. 
When data from some individual units are lacking, the “missing-plot’ 
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technique developed by Yates enables the available results to be fully 
utilized. Some extra computational labor is, however, involved, and if 
the gaps are numerous the design is less convenient in this respect than 
complete randomization. 

4. If the experimental error variance is larger for some treatments 
than for others, an unbiased error for testing any specific combination 
of the treatment means can still be obtained. 

No design is more frequently used than randomized blocks. Certainly 
if a satisfactory degree of precision is reached, there is little need to search 
for alternative designs. 

It is worth noting that the replication means provide unbiased com- 
parisons of the differences among replieates. Occasionally, these dif- 
ferences measure some property of the experimental material that is of 
interest. The variance-ratio test of the replications against the error 
m.s. requires some care in its interpretation. In а greenhouse experi- 
ment, for instance, a different type of soil might be used in each replica- 
tion. Significant differences among replications might be due either to 
differences in soil type or to differences in the positions of the replications 
within the greenhouse. 


4.22 Randomization. When the units have been grouped, the treat- 
ments are assigned at random to the units within each group. А new 
randomization is made for every group. Unless the number of treat- 
ments exceeds 16, tables 15.6 and 15.7 are convenient for this operation. 


4.23 Statistical Analysis. The example comes from an experiment 
carried out by the North Carolina Agricultural Experiment Station at 
Rocky Mount, М. C., in 1944. The experiment tested the effects of 5 
levels of application of potash, supplying respectively 36, 54, 72, 108, 
and 144 Ib. K3O per acre, on the yield and properties of cotton. The 
measure chosen for analysis is the Pressley strength index. This is 
found by measuring the breaking strength of a bundle of fibers of a given 
cross-sectional area. A single sample of cotton was taken from each plot, 
and 4 determinations were made on each sample. The figures in table 
4.4 are the means of these 4 samples.* The experiment was arranged 
in 3 randomized blocks of 5 plots each. 

To make the computing instructions more general we suppose that 
there are ¢ treatments and г replicates, and that y denotes a typical ob- 
servation. 

* Since the machine which measures the index is calibrated in arbitrary units, no 


dimensions are ascribed to the data in table 4.4. Тһе index ean be converted ap- 
proximately into pounds per square inch by means of a regression formula, 
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Step 1. Find the treatment totals (Т), the replicate totals (В), and 
the grand total (б). 


TABLE 4.4 STRENGTH INDEX OF COTTON IN A RANDOMIZED BLOCKS EXPERIMENT 


Replications 
Treatments Coded 
Pounds КО рег acre 1 2 3 Totals dressing 
36 7.62 8.00 7.93 23.55 13 
54 8.14 8.15 7.87 24.16 8 
72 7.16 7.73 7.74 23.23 3 
108 7.17 7.57 7.80 22.54 -7 
144 7.46 7.68 7.21 22.35 —17 
Totals 38.15 39.13 38.55 115.83 
Analysis of variance 
Source of variation d.f. 8.8. m.s. F 
Replications r-1) =2 0.0971 
Treatments (- 1) = 4 0.7324 0.1831 4.19* 
Error @—1)@—1) = 8 0.3495 0.0437 
Total n=l =14 1.1790 
Subdivision of the treatments 8.8. 
d.f. 8.8. ms. Е 
Linear response 1 0.5663 0.5663 12.96** 
Deviations 3 0.1661 0.0554 1.27 
Step 2. The sums of squares are obtained as follows. 
2 
Correction factor: С = £ £689) = 894.4393 
tr 15 
Total: У y? — C = (7.62)? + (8.14)? +- - -+ (7.21)? — 894.4393 = 1.1790 
tui R? .15)? .13)? 2 
Replications: У) i кыбы Ан а-э + (88.55); ' 894.4393 = 0.0971 
Treatments: 
тё 23.55)? + (24.16)? +... 2 
DBE SENG) E PEE А FEO 504.4393 = 0.7324 


Error: (total s.s.) — (replications 3.3.) — (treatments 8.8.) 
= 1.1790 — 0.0971 — 0.7324 = 0.3495 


The F-ratio for treatments (table 4.4) is 0.1831/0.0437, or 4.19, which 
is significant at the 5% level with 4 and 8 df. The treatment totals 
suggest that the strength decreases with increasing applications of 
potash, though there is a hint of a maximum in strength for the 54-Ib. 
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application. Accordingly it is worth while to examine the shape of the 
response curve. We will first fit a linear regression on the amount of 
dressing. Note that successive increments of dressing are not equal, the 
last two increments being twice as large as the first two. This is a 
fairly common practice in cases where the effectiveness of an increase in 
dressing is expected to be smaller at the higher levels of application. 

Although the regression can be calculated by the standard formula, 
a simple coding of the dressings lightens the work. If we place. the 
lowest dressing (36 Ib.) at zero and take 18 Ib. as 1 unit оп the scale, the 
dressings may be coded as 0, 1, 2, 4, and 6, respectively. A further de- 
vice is to subtract a common amount from these values so that their 
mean is zero. Since the mean of the coded dressings is 13%, this is the 
amount that must be subtracted. To avoid fractions, we multiply all 
values by 5 and subtract 13 from each product. The resulting coded 
dressings are shown in table 4.4; the signs have been changed so that the 
regression coefficient will be positive. 

The regression coefficient is now obtained as 


p _ 03)028.55) + (8) (24.16) + (3)03.23) — (7)(22.54) — (17)(22.35) 
(3)((13)? + (8)? + (8)? + (7)? + 077 


The factor (3) in the denominator is inserted to convert the treatment 
totals to means. The regression coefficient represents the average de- 
crease in strength for a unit increase on the coded scale. Since 5 units 
on the coded scale correspond to 18 Ib. K20, the decrease in strength for 
each additional 18 Ib. КО is estimated as 0.090. 

The contribution of the regression to the sum of squares in the analysis 
of variance is (by rule 1, section 3.42) 


(31.39)? 
1740 


as shown in the analysis of variance. The contribution from the linear 
regression is significant at the 1% level. Since the mean square for 
deviations from the regression, 0.0554, is only slightly above the error 
m.s., there seems no point in investigating a quadratic regression, though 
the reader may care to verify that its contribution is small. 

The conclusion from the data is that increased dressings of potash 
produce a weaker fiber, the strength index declining by 0.090 for each 
18 Ib.-increment in КО. Ав an exercise we will obtain confidence limits 
for this rate of decline. We first require the estimated standard error for 


= 0.5663 
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the 0.090 figure. This сап be found by expressing the figure as a linear 
function of the treatment means jj; and applying rule 5a of section 3.51. 
We have 

0.090 = 590137: + 892 + 373 — 774 — 1795] 
so that the estimated standard error is 

V 0.0437 5 
Yor =—— — V3)? + (8)? +--+ 17)? 

bh 278 580 (13)? + (8) ( 
5 шаа 0.0251 
NND re ate 

Alternatively, this value could be derived by noting that the square root 
of the F-value for the linear regression, i.e., М 12.96, or 3.6, is the ratio 
of 0.090 to its estimated standard error. Thus the latter must be 
0.09/3.6, or 0.0250. Finally, for the 80% confidence limits, we have 
(0.090 + 0.025 X 1.397), or (0.090 + 0.035), where 1.397 is the value of 
1 for а probability 0.20 and 8 d.f. 


vr 


4.24 Standard Errors. The estimated standard error of the difference 
between two treatment means is 


ps? 
8 = T 
r 


If some treatments receive extra replication, the general formula is 


sa = 82 (= + 5) 
тт 12 


In experiments where the error appears to be heterogeneous, a separate 
error may be obtained for any pair or for any group of treatments. For 
this a new randomized blocks analysis is carried out on those treatments 
which belong to the group under consideration. 


4.25 Missing Data. The method of analysis when part of the data is 
missing is described by Yates (4.4). With a single missing unit, the first 
step is to calculate a value for the unit by means of the formula 
_ B+ -G 

(т — 1)@— 1) 
where B is the total of the remaining units in the block where the missing 


unit appears, Т is the total of the yields of this treatment іп the other 
blocks, and G is the grand total; r and t are the numbers of replicates and 


y (4.1) 
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treatments respectively. The analysis of variance is then carried out as 
usual except that 1 d.f. is subtracted from the total s.s. and from the 
error 8.8. 

The standard error of the difference between the mean of the treat- 
ment with a missing value and the mean of any other treatment is 


^1 деч бае 
, [тали ү 


When there are several missing values, for units а, b, c, d,--+, we first 
guess values by inspection for all units except a. Formula (4.1) is then 
used to find an approximation for a. With this approximation and the 
values previously assumed for c, d,» - -, we again use formula (4.1) to in- 
sert an approximation for b. After a complete cycle of these operations, 
а Second approximation is found for a and so on until the new approxi- 
mations are not materially different from those found previously. The 
analysis of variance is then completed; for each missing unit, 1 d.f. is 
subtracted from the total and error s.s. 

Suppose that in the previous example two observations had been 
missing and that the data had appeared as follows. 


Replications 
— 

Pounds K3O 1 2 3 Totals 
36 a 8.00 7.93 15.93 

54 8.14 8.15 7.87 24.16 

72 7.76 b 7.74 15.50 

108 7.17 7.57 7.80 22.54 

144 7.46 7.68 7.21 22.35 


Totals 30.53 31.40 38.55 100.48 


Since differences between replications are not pronounced, we might 
take as a trial value for a the mean of 8.00 and 7.93, or 7.96. For 
estimating b;, the first trial value of b, by formula (4.1) we now have 

B = 31.40; T = 15.50; G = 100.48 + 7.96 = 108.44 
so that 

y, _ (8008140) + (6)(15.50) - 108.44 

: 8 
For estimating аҙ, the second trial value of а, we now have 

B = 30.53; T = 15.93; G = 100.48 + 7.91 = 108.39 
so that 

e (3)(30.53) + (5)(15.93) — 108.39 


8 


= 7.91 


= 7.86 
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Taking аз as 7.86, we find that the next trial value of b is 7.92, This is so 
close to the previous value that we may stop. Thus a = 7.86, b = 7.92. 

The analysis of variance is now computed with these values inserted. 
There will be 12 d.f. in the total s.s., and 6 d.f. in the error в.з. The 
error m.s. is 0.0491. 

To obtain a standard error for the comparison of two treatments, A 
and B, we assign an “effective” number of replicates to each treatment. 
Any replicate of treatment A is counted as 1 when both A and B are 
present in the replicate, as (¢ — 2)/(¢ — 1) when A is present and B is 
not, where ¢ is the total number of treatments in the experiment, and 
as 0 when A is missing. The same rule is applied in scoring В. Sup- 
pose that in the example we are comparing the means of the 36- and 
72-Ъ. dressings. Since ¢ = 5, the effective replication for each mean 
is scored as 1%. The standard error of their difference is taken as 
Мз?(% + +), or М 1.1438. If we are comparing the mean of the 36-1. 
dressing with that of the 54-Ъ. dressing, which has no missing values, 
the score for the first mean is 2 and that for the second 234, so that the 
standard error of their difference is V s?($ + тг), огу 0.94452. 

This useful approximation, due to Taylor (4.17a), is a refinement of 
an earlier rule given by Yates. The exact formulae are laborious to 
compute. In the example the correct values for the standard errors of 
the two differences considered above are 1.069s and 0.937s, as compared 
with the approximate values of 1.068s and 0.972s, respectively. 

As a result of the disturbance introduced by the missing units, the 
treatments m.s. is slightly too large; however, the variance-ratio test is 
unlikely to be much in error unless a substantial proportion of the units 
are missing. Yates (4.4) gives the method for obtaining an exact test. 

4.26 Estimation of Efficiency. If E, and E, аге the block and error 
m.s. and ль, ту, and n, the block, treatment, and error degrees of freedom, 


M пЕь + (м + ne) Ee 
ny + M + те 


ів an estimate of the error variance of a completely randomized design 
with the same experimental material. Comparison of Е... and Ee, 
taking account of the change in numbers of error degrees of freedom 
(section 2.31), provides an estimate of the increase in accuracy which 
results from the grouping into replicates. 

The result in (4.3) may be proved in various ways. Since there is no 
complete discussion of results of this type in the literature, one method 
of proof will be sketched; the details, which are a matter of algebraic 
manipulation, will not be given completely. This proof uses the proper- 


Ес. (4.3) 
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ties of randomization. It is not the easiest proof, but requires very few 
assumptions. 

Let the experiment have ¢ treatments and r replications, and let e; 
be the experimental error of the observation on the ith unit, and т; the 
effect of the jth treatment. No assumption is made about the nature 
or distribution of the errors. For this reason it is not necessary to 
introduce any specific symbol for the effect of the replication, since 
any type of effect can be represented by appropriate choice of the e’s. 
The treatment effect and the error are assumed to be additive; that is, 
if the randomization happens to put the jth treatment on the ith unit, 
their joint effect is (т; + е;). Throughout the randomization the e; are 
regarded as a set of fixed numbers, each associated with a specific unit. 

Without loss of generality, we may assume that the total of all ob- 
servations is zero, and that the totals of the e’s and the 7’s are both zero. 


Let я t 
S=} ef; T= Уут 
is іт 
Further, for а] possible randomizations of the randomized blocks design 
the replications will of course remain the same; consequently, for any 
given batch of data the replication totals remain unchanged. Let R de- 
note the sum of squares of these totals, divided by 1. 

The main part of the proof consists in working out the average values 
of the mean squares in the analysis of variance, taken over all possible 
randomizations of each type of design. The results come out as shown 
in table 4.5. 


TABLE 4.5 AVERAGE VALUES OF MEAN SQUARES TAKEN OVER THE 
RANDOMIZATION SETS 


Completely randomized 
d.f. ж 
TT 8 
Treatments (—1 4-1) É (rt — 1) 
S 
Error t(r — 1) (rt — 1) 
Randomized blocks 
ав шз. 
ҒҒ Е 
Replications (г scu (т — 1) 
rT 8 Е = 
Treatments 4-0 ет + "(-1) +@—1) 
8 Е 
Егтог є—1)@—1) 


-D @—1 
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This analysis leads at once to the result. Since 
my—(r—1; т-т-т(- 1); 


it is seen from table 4.5 that for randomized blocks the average value 
of nEs is В, while that of (n, + n-)H, is (S — R). It follows that the 
average value of 


пЕь + (и + п) Е-8-Е 8 


ть + т + Ne (rt — 1) (rt — 1) 
But this is the average of the error m.s. for a completely randomized 
design on the same data. 
As an illustration of the details, consider the treatments m.s. for ran- 
domized blocks, which is probably the hardest term. The square of the 
first treatment total is 


Ty? = (т + а + e d ey 


When we average over all possible randomizations, there is no contribu- 
tion from terms in тез, since the treatment appears equally often on all 
units, and the total of the e’s is zero. The average value of the contri- 
bution from the г terms in е? is the sum of the squares of all the e’s, 
divided by t, or 5/4, since the treatment appears on any specific unit 
in a fraction 1/t of all randomizations. 

The contribution from terms in 6;6; may be written 


ei(ez + es best er) + en + ев bb е) 4 

Tee + е +... er) (44) 
The randomization is restricted so that every e comes from a different 
replication. If the subscript denotes the replicate, the mean value of 


exe; is R,R,/t, where В; is the replication total of the e's. Consequently, 
the mean value of (4.4) is 


1 
g as + В +--- В) + RS + № +. RR) +... 

1 В 
TR + В +++ В, )] = = а Un RP ee EJ) = —-—- 


since the total of the R; is zero, and since R will be recognized аз the sum 
of squares of the R;, divided by 2. Hence 


If this expression is summed for all ¢ treatments and divided by r(t — 1) 
to give the treatments m.s., the result in table 4.5 follows. 
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4.27а Statistical Analysis with Data Arranged in Two Classes. The 
method of analysis when the results consist of data classified into two 
classes will be presented first for an experiment with two treatments. 
Consider an experiment in which the objective was to compare the abili- 
ties of two culture media to detect the presence of Salmonella organisms 
in specimens of feces. Samples from each of 222 specimens were grown 
on each medium. For each sample a record was made as to whether 
presence (+) or absence (—) of the organisms was found. The experi- 
ment was thus a randomized blocks experiment with 222 replications, 
the pair of samples from each specimen constituting a block or replica- 
tion, Real differences between blocks would be expected, because many 
specimens may contain no Salmonella, in which case both media should 
give —, while other specimens may have many organisms present, in 
which case both media are likely to give + even if one of them is much 
poorer than the other. 

At first sight it seems natural to present the data in the usual 2 X 2 
table for a test of the difference between two proportions, as in table 4.3а. 


TABLE 43a Момвевз or (+) AND (—) FOR TWO GROWTH MEDIA 
Medium 
A B 


+ 12 7 19 
- 210 215 425 


Total| 222 222 444 


The figures to be compared аге the proportions of positives, 12/222 
and 7/222, for the two media. However, because of the arrangement 
of the experiment in randomized blocks, these two proportions are posi- 
tively correlated, so that the ordinary x^ test for the difference between 
two proportions is invalid (except that this test may be an adequate 
approximation when block effects are small). 

The correct test cannot be made from table 4.3a, since this table does 
not show the results in individual blocks. Presentation of results for 
the 222 blocks can be considerably condensed, because in any block 
only four combinations of results for the two media are possible: ++; 
+—; —+; or ——. It is sufficient to record the numbers of blocks 
which contain each of these four possibilities. The data obtained for 
this experiment are shown in table 4.4a (i). The right half of the table 
Gi) gives an equivalent presentation that is often used. Note that in 
the right half of the table the proportions 12/222, 7/222 now lie in the 


margins. 
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TABLE 44а CONDENSED PRESENTATION OF RESULTS FOR INDIVIDUAL BLOCKS 


(i) Gi) 

Combination | No. of 

A B | blocks 
Bp ees 
pnto» ШШ; шшш 
us B 5 + 7 0 7 
= + 0 - 5 210 215 
hore 010 DU A айыы c= кін 
12 210 222 
То make the test, ignore all blocks showing ++ or — —. These 


blocks give no clue as to which treatment is superior and, as we have 
indicated, their results may be due to the nature of the blocks rather 
than to the performances of the treatments. 

This leaves 5 blocks. If A and B are equally effective, the number 
of blocks in which A is + and B is — should equal that in which A 
is — and B is +, apart from sampling errors. Consequently we test the 
observed partition, 5:0, against an expectation of 214 in each cell by the 
usual x? test for a single binomial. If a is the number of (4--) blocks 
and b the number of (— 4-) blocks, the value of x?, corrected for con- 
tinuity, simplifies to 
M (a—b—1»? 3 (4.0)? 

a+b 5 

In the numerator, the absolute value of (а — Б) is reduced by 1 before 
squaring. The probability is 0.074. 

The exact significance probability can be calculated from the binomial 
series (1% + 14)”, as follows. The probability that A should receive all 
5 + signs is 1/2° or 1/32. For a two-tailed test, we double this value, 
giving a probability of 0.062. The difference in the proportions of + 
results for the two media is not quite significant. This exact test, known 
as the sign test, has numerous applications in other situations (4.3a). 

With more than 2 treatments, the data are arranged as on the left of 
table 4.5a, which shows results of a comparison of 4 growth media for 
diphtheria ‘bacilli from throat washings of 69 subjects. Four blocks 
(subjects) gave + on all 4 media; two gave + on A, B, D and — on 
C, and so on. For a x? test of the hypothesis that there are no differ- 
ences among treatments, compute the following quantities: 


Т; = total number of + in ith treatment 
B; = total number of + in jth block 


= 3.2 (144) 
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TABLE 4.5a PRESENTATION OF RESULTS WITH 4 TREATMENTS 


Medium 
No. of Freq. dist. of no. of 
A B с D_ | blocks + in blocks 
+ + "E rp: 4 Bj fi 
+ or = + 2 4 4 
ізі + + + Б] 3 5 
= а 1 2 1 
- = - = 59 0 59 
Total + 6 10 T. dU 69 


А frequency distribution of the B; (made from the right side of table 
4.52) is shown on the right of this table. From this, compute 


У лв; = 33 = Ут, fB? = 113 
Then, for ¢ treatments, 


, 0-10) 507 72 (9e | 
t È (GB) — У (GB?) (4)(83) — (113) 


with (t — 1) or 3 d.f. The significance probability is about 0.045. For 
more detailed comparisons among a subgroup of the treatments, make а 
separate table containing the results for this subgroup alone, and apply 
the same test. If there are only two treatments, this x? test reduces to 
the one given earlier in this section (without the correction for continuity) 
(4.42). 


43 Double Grouping: Latin Squares 


4.31 Description. In the latin square the treatments are grouped into 
replieates in two different ways. Examples of latin squares are shown 
for different numbers of treatments in plan 4.1 (p. 145). Note that 
every row and every column of any square is a complete replication. 
The effect of the double grouping is to eliminate from the errors all dif- 
ferences among rows and equally all differences among columns. Thus 
the latin square provides more opportunity than randomized blocks for 
the reduction of errors by skillful planning. 

The experimental material should be arranged and the experiment 
conducted so that the differences among rows and columns represent 
major sources of variation. Some examples of the uses of latin squares 
in various fields of research may indicate the utility of the design. In 
field experiments the plots are usually laid out in a square formation, 
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so that soil fertility and other variations in two directions are controlled. 
Variations along and across the greenhouse bench may be similarly 
handled in greenhouse experiments. Occasionally the latin square is 
advantageous even when the plots form a continuous line. In this case 
the rows may be compact blocks of land while the columns specify the 
order within each block. If the yield gradient is suspected to be in the 
same direction all along the line, blocks and order in blocks together re- 
move the effects of the gradient more thoroughly than a single control. 
An example of this type is shown below. 


TABLE 4.6 A LATIN SQUARE WITH THE PLOTS IN ONE CONTINUOUS LINE 


Seven treatments A, B, C, D, E, F, G 
(Experimental limitations force the plot units to go at right angles to gradient.) 


Rep 1 п ш ту v м уп 

ИШИНИН 

Main and Tippett (4.5) describe how 4 Х 4 latin squares were em- 
ployed in experiments on the weaving of cotton cloth. The purpose of a 
series of experiments was to investigate the effect of the sizing treatment 
applied to the warp. The criterion was the number of breaks in the warp 
during weaving. Four warps, each with a different sizing treatment, 
were woven simultaneously on 4 different looms, which could be super- 
vised by a single weaver. Then each warp was moved to a different loom 
of the set so that after 4 periods every sizing treatment had been tested 
on all 4 looms. 


If A, B, C, D represent the 4 warps, the latin square used was as 
follows. 


Looms 
Periods 1 2 3 4 
I A D B с 
П р с А В 
ш с В D A 
IV B A с р 


This arrangement eliminates constant differences among the looms, 
which were found to be large, and also differences among the 4 periods 
of weaving. No “period” differences were apparent—a result which 
might be anticipated, the authors suggest, because humidity and warp 
tension were controlled throughout the experiment. 

Frequently, particularly in industry, an experiment requires a series 
of operations each of which may introduce variability into the final re- 
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sults. In such cases the latin square may be useful in a preliminary in- 
vestigation of the sources of variation. For example, in the preparation 
of an explosive mixture used in primers, variation may occur either in 
the mixing of the ingredients of the explosive or in the process of charg- 
ing. One experiment of this type involved 4 mixing-blending teams and 
4 charging operators. On each day, the product of each team was sent 
to a different charging operator, the arrangement being changed daily 
according to the following 4 X 4 latin square (letters W, X, Y, and Z 
represent the mixing-blending teams). 


Charging operators 
1 2 3 4 
Monday т 2 х yY 
Tuesday X и Y 2 
Wednesday Ү х 2 и 
‘Thursday 7 Y Ww x 


The latin square analysis of variance enables us to isolate consistent dif- 
ferences amongst the teams and consistent differences amongst the 
chargers, as well as day-to-day variations. 

Аз another example of this type, McGehee and Gardner (4.2a) em- 
ployed a 5 X 5 latin square to measure the effect of factory music on 
the production of women workers in the operation known as “setting” 
in rug manufacturing. Four distinct music programs (A, B, C, D) were 
compared with no music (Z). A single replication occupied the 5 work- 
ing days of the week, a different program being tested on each day. The 
latin square was used to rotate the programs from week to week, so that 
over a 5-week period each program appeared once on any specific work- 
ing day of the week, as shown below. 


Week Mon. Tues. Wed. Thurs. Fri. 


CUR OO tO 
oou» 
сыьсш 
whia 
ьс шщ 
auvi 


Consistent differences in production rates between weeks and between 
days of the week are eliminated from the errors of the comparisons among 
programs. The example illustrates how a valid experiment can be con- 
ducted under ordinary factory conditions on a question on which, as the 
authors point out, little data are available although many pronounce- 
ments about the effects of music have been made. 
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Аз а more complex example, а 12 X 12 latin square was used by Chen, 
Bliss, and Robbins (4.6) to estimate the toxicities of 12 poisons when 
applied to cats. Each drug was injected into the femoral vein of the 
cat at 1 с.с. per minute until the cat died, the amount required being 
recorded. Since an observer could administer only two drugs at one 
time, three observers participated, each treating two cats in the morning 
and two in the afternoon. The rows of the latin square eliminated sys- 
tematic differences between observers and between the morning and 
afternoon injections. The columns represented days. All three major 
variables—day, time of day, and observer—would have inflated the 
experimental error if left uncontrolled. The plan is shown below (let- 
ters represent drugs). 


Day 
Time | Observer 

1|2|814|5|6|7|8 10 | 11 | 12 
10:30 A.M. I I|J|B|L|H|IG|F|K|D|E|A|C 
KG.J|H|I|B|L|IC|E|PF|D|A 
II BI|L|G|C|D|J|K|E|H|A|F I 
ЕЕ а КАБ Св | E 
ш С|КЦА|ВІР |І |І |рІСІНІУІЕ 
ЕНКЕС С БВ АЕ Г 
2:30 р.м. 1 J|C|E|KI|A|I|H|F|B|G|L|D 
D|IF|I|A|L|E|C|G|J|B|H|K 
II Ав бр Ева нтк г 
BBL ОГ ВТ |ж |6] „ 
ІП er FLUE HN XLlA|LLOG E|B 
ЖАСАЛУЫН В ng S ETJIAXIG 


4.32 Number of Replications. The chief restriction on the utility of 
the latin square is that the number of replicates equals the number of 
treatments; if the latter is considerable, the number of replications re- 
quired becomes impractical. Squares larger than 12 x 12 are seldom 
used, while the most common range is from the 5 X 5 to the 8 X 8 
square. Latin squares also suffer to some extent from the same dis- 
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advantage as randomized blocks in that the experimental error per unit 
is likely to increase with the size of square. 

The small squares provide only a few degrees of freedom for the 
estimation of error—none with the 2 X 2, two with the 3 X 3, and six 
with the 4 X 4. This fact precludes the use of single 2 X 2 squares, 
while the 3 X 3 and 4 X 4 squares must produce a substantial reduction 
in error over randomized blocks or complete randomization to counter- 
balance the loss of degrees of freedom. More than one square may, 
however, be included in the same experiment. Three 3 Х 3 squares 
(9 replicates) furnish 10 error d.f., while two 4 X 4 squares give 15 d.f., 
provided that in each case the “squares X treatments” interaction can 
be pooled with error. A considerable number of 2 X 2 squares would be 
required, since the error degrees of freedom are 1 less than the number of 
squares. 


4.33 Randomization. A complete representation of the squares from 
4 X 4 to 6 X 6 and sample squares up to the 12 X 12 is given by Fisher 
and Yates (4.3). For squares up to the 6 X 6, the randomization pro- 
cedure given in this reference selects (with a minimum of labor) a square 
at random from all latin squares of a given size. For a discussion of the 
theoretical basis of the randomization, see Yates (4.7). 

Examples of latin squares are shown in plan 4.1. The method of 
randomization for plan 4.1 is as follows. 

8 X8. Arrange the columns at random and the last 2 rows at random. 

4 X4. Select at random one of the 4 squares. Arrange at random all 
columns and the last 3 rows. It is equally good, though not strictly 
necessary, to randomize all rows and columns. 

5 X 5 and higher squares. Arrange all rows, columns, and treatments 
independently at random. 

For 3 X 3 and 4 X 4 squares, this procedure selects one square at 
random from all possible squares. For 5 X 5 and larger squares, some 
types of squares have no chance of being selected if plan 4.1 is used. 
Unless latin squares are used very frequently, however, the randomiza- 
tion sets are sufficiently large for experimental plans. 


4.34 Statistical Analysis. Several experiments have demonstrated 
that people find it difficult to select, by personal judgment, unbiased 
samples even from relatively small populations that can be thoroughly 
inspected before selection. In this experiment, each population con- 
sisted of a small area of wheat, containing about 80 shoots, the shoots 
being slightly over 2 feet high. There were 12 samplers, all experienced 
in studying the growth of wheat. Each sampler inspected each area and 
measured the heights of 8 shoots whose heights were to give a represent- 
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ative sample of the shoot heights in the area. The quantity that will be 
analyzed is the difference between the mean height of the 8 selected 
shoots and the true mean height in the corresponding area, in other 
words the sampler’s error. 

The samplers were divided into two groups of six, of which we will 
consider only the first group. The samplers represent the experimental 
treatments. There were 6 areas, each sampled by all 6 samplers in the 
group. These areas, which serve as the replications, may be taken as the 
columns of the 6 X 6 latin square. The rows of the square prescribed 
the order in which each man sampled the 6 areas (see table 4.7). This 


TABLE 4.7 SAMPLER'S ERRORS IN SHOOT HEIGHTS (cENTIMETERS) 
6 X 6 LATIN SQUARE 


Areas 
Order "Totals 


I Р+ 3.5 | B+ 42 | A+6.7 | D+6.6 С--41 | Е-+ 3.8 +28.9 
Ir B+ 8.9 | F+ 1.9 | 2+5.8 | А+4.5 E+2.4 | C+5.8 +29.3 
ш C+ 9.6 | E+ 3.7 | Р-2Л | B+3.7 D+6.0 | A+7.0 +27.3 
ТУ) D+10.5 | C+10.2 | B+4.6 | Е+3.7 А-5.1 | F+3.8 +37.9 
ү E+ 31 | A+ 72 | С+40 | F—33 B+3.5 | D+5.0 +19.5 
VI A+ 5.9 | D+ 7.6 | E—0.7 | С+30 F--40 | B+8.6 +28.4 


Totals +15 +348 +17.7 4182 4251 4340 41713-6 


Totals for samplers 
A B с р Е F 
+364 +33.5 +36.7 -+41.5 +160 +72 


feature served two purposes, It helped to prevent the men from getting 
in each other’s way, since no two men worked on the same area at the 
same time, and it permitted an examination of the effects of the order of 
sampling on the errors, 

Since only 3 of the 36 errors in table 4.7 are negative, there is evidence 
of в rather consistent tendency towards overestimation of the shoot 
heights. For simplicity, the observations will be referred to as measures 
of bias, though of course they contain a component due to chance 
fluctuations. There is also an indication, on casual inspection of the 
table, that the bias varies from one sampler to another. This point will 
be tested in the analysis of variance. The computations are as follows. 
First find the totals for each row, column, and latin letter and the grand 
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total, G, shown in table 4.7. The following sums of squares are then 
calculated. 
G (тіз) 


Correction factor: C = ту i 815.10 

Total: (3.5)? + (4.2? +- -+ + (8.6)? — С = 1144.73 — 815.10 = 329.63 
Rows: $[(28.9)? +- - -+ (28.4) — С = 843.70 — 815.10 = 28.60 
Columns: 11(41.5)? +- +++ (840) — С = 893.97 — 815.10 = 78.87 


Treatments: $1(36.4)# -----4-(7.2)]] — С = 970.70 — 81510 = 155.60 


The divisor 6 is replaced by г for ап г Х 7 square. 
These sums of squares are entered in the analysis of variance. The 
error s.s. is found by subtraction. 


TABLE 4.8 ANALYSIS OF VARIANCE FoR A 6 X 6 LATIN SQUARE 


Source of variation d.f. 8.8. m.s. F 
Rows (order of sampling) (г — 1) = 5 28.60 5.720 1.72 
Columns (areas) (т — 1) = 5 78.87 15.774 4.74 ** 
Treatments (samplers) (r—1) -5 155.60 31.120 9.35 ** 
Error (т — 1)(т — 2) = 20 66.56 3.328 

Total (5-1) =35 329.63 


То obtain the mean squares, each sum of squares is divided by the 
corresponding number of degrees of freedom. The F value for samplers 
is 31.120/3.328, or 9.35, well beyond the 1% level, which is 4.10 for 
5 and 20 d.f. This shows that the extent of the overestimation varies 
from one sampler to another. The means for the samplers are given 
below. 


A B с D E F 
+6.07 +5.58 +6.12 +6.92 +2.67 +1.20 


The estimated standard error of each mean is V/s?/r, where 8° is the 
error m.s., or in this case V (3.328)/6 = 0.745. For testing the dif- 
ference between a pair of means, the standard error is 4/2 (0.745) = 
1.053. Since the 5% t value for 20 d.f. is 2.086, the difference between 
two means must be at least (2.086)(1.053), or 2.20, in order to attain 
significance at this level. It appears that the samplers fall into two sets— 
A, B, C, D and E, F, the biases being significantly smaller for Ё and F 
than for the others. 

Some further information about the nature of the biases can be ob- 
tained from an examination of the row and column totals. From table 
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4.8 the F value for rows is seen to be 1.72, which is below the 5% value, 
2.71. No clear effect of the order in which the areas were sampled has 
therefore been established ; further, the row totals (table 4.7) do not sug- 
gest any consistent trend. 

On the other hand, the Р value for the columns m.s., 4.74, is significant 
at the 196 level, so that the bias varies from area to area. Previous 
observations had indicated that the bias might depend on the mean shoot 
height of the area. The column totals are ranked below in order of ins 
creasing true mean shoot height. 


True mean Column 
shoot height total 
Area (centimeters) (centimeters) 
1 59.0 T4.5 
2 66.2 +34.8 
6 72.3 +34.0 
4 74.5 +18.2 
5 76.0 +25.1 
3 76.4 +17.7 


Apparently on areas where the shoots are not so high the positive bias 
is greater than on areas with relatively high shoots The result is not 
surprising, since the sampler had to stoop in order to do his work. The 
linear regression of the column totals on the true mean heights contrib- 
utes 59.88 to the sum of squares for columns, 78.87. The mean square 
for deviations from this regression (4 d.f.) is only 4.75, which is not sig- 
nificantly greater than the error m.s. The inference is that the large 
value of the mean square for columns is due mainly to the negative 
correlation between the bias and the mean height. For further discussion 
of this experiment, see (4.8). 

The analysis of two 4 X 4 squares is discussed in (4.9). A numerical 
example with five 3 X 3 squares will be found in (4.10). It should be 
noted that with several latin squares the interaction of treatments with 
squares may be tested. If there is no reason to expect that this inter- 


action is real, the corresponding sum of squares may be combined with 
error. 


4.35 Standard Errors. Theformula pe gives the estimated standard 


error of the difference between two treatment means, where s? is the 
mean square per unit and т the number of replicates, In cases where 
heterogeneity of errors is suspected, the error cannot be subdivided with 
the same ease as in randomized blocks. Itis possible to remove from the 
error 8.8. the contribution of any one of the treatments, Yates (4.11). For 
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each yield y; of the treatment, calculate the quantity 
d; = Ri + Ci — ryi 


where R; and С; are the corresponding row and column totals, and 7 is 
the number of treatments. The sum of squares of deviations of the 
quantities d; divided by r(r — 2), is the contribution of this treatment 
to the error s.s. and has (г — 1) degrees of freedom. For illustration, 
suppose that we wish to calculate the contribution of sampler F to the 
error in section 4.34. From table 4.7 the first d value is 


d, = (+28.9) + (+41.5) — 6(+3.5) = +49.4 


and the remaining values will be found to be +52.7, + 61.2, +57.5, 
+-29.5, +49.1, with a total +299.4. The sum of squares of deviations 
from the mean is 610.34. Divided by 24, this gives 25.43, with 5 d.f., 
for the contribution of sampler F to the error s.s. Consequently the 
error s.s. (table 4.8) may be divided as shown below. 


ағ 8.5. m.s. 
"Total error 20 66.56 3.328 
Contribution of F 5 25.43 5.086 
Remainder 15 41.13 2.742 


The error m.s. for the suspected treatment may be compared with 
the mean square for the remaining (r — 1)(r — 3) degrees of freedom. 
to test whether the treatment shows abnormally high variation. In 
this example the variance ratio is 1.85, with 5 and 15 df. If it is con- 
cluded that the treatment is unusually variable, the mean square with 
(r — 1) (r — 8) degrees of freedom is used as error for the other treat- 
ments, It should be stressed that the ordinary variance-ratio tables can 
be used in this test only if the decision to make the test was taken before 
examining the individual results. If the test is made merely because one 
treatment looks anomalous on inspection, different significance levels 
that take account of this fact are needed, as discussed in section 3.53. 


4.36 Missing Values. Except for changes in the formulae, the pro- 
cedure is similar to that for randomized blocks (section 4.25). А single 
missing value is substituted by means of the formula, reference (4.4) 


r(R +C + T) — 2G 
(= 06-2 


where R, C, T, and G are respectively the totals of the row, column, and 
treatment which contain the missing value and the grand total. The 


(4.5) 


y- 
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estimated standard error of the difference between the corresponding 
treatment mean and the meari of a treatment with no missing values is 


2 1 
iiu S E 


When several values are missing, repeated application is made of for- 
mula (4.5) as described in reference (4.4). When values have been in- 
serted for all missing data, the usual analysis of variance is calculated. 
One degree of freedom is subtracted from the error d.f. for each missing 
value. 

The exact formula for the standard error of a treatment mean is com- 
plex. Yates (4.4) proposed a useful approximate rule, which shows the 
number of replicates to be assigned to any treatment mean for a com- 
parison with another treatment mean. Each observation on one treat- 
ment is given one replication if the other treatment is present in both 
the corresponding row and column. The replication is 24 when the 
other treatment is missing in the row or column and is 14 when the other 
treatment is missing in both the row and column. When the treatment 
itself is missing, the replication is 0. Although complicated at first sight, 
the rule is not difficult to apply and is reasonably accurate. Consider for 
instance the following 6 X 6 square with 3 units missing (two B’s and 
one E) as indicated by the parentheses (). 


B E 6. F D A 
F D (B) E A g 
t A E D B F 
(E) с А (В) Е р 
А F D с Е В 
D B F A с Е 


Taking the yields of B and Е in the order of the rows, we ascribe the 
following numbers of replicates. 


B: $+0+14+0+14+1=32 = 44 

E: 14+44240+141=4 
Hence the standard error ot the difference between the treatment means 
is Vs" +4). For the difference between A and B, the reader may 
verify that the standard error is V #(%+H). Notice that the ef- 


fective replication for a treatment may change from one comparison to 
another. 


| Тһе analysis required when a single row, column, or treatment is miss- 
ing is described by Yates (4.11). When more than one row, column, or 
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treatment is missing, consult Yates and Hale (4.12). These methods 
are also presented by DeLury (4.13). 


4.37 Estimation of Efficiency. The effectiveness of either the row or 
the column grouping may be tested from the results of latin squares. 
The expression 
_ Е, + (n; + п.) He 
Ny + dne 

is an estimate of the error m.s. which would have been obtained if the 
row grouping had not been used, 1.е., if the design had been randomized 
blocks with the columns as blocks. In the formula, E, and E, are the 
mean squares for rows and error, respectively, and n;, 74, n, are the num- 
bers of degrees of freedom for rows, treatments, and error. The in- 
creased number of degrees of freedom with randomized blocks is taken 
into account by the method of section 2.31. The effectiveness of the 
column grouping is tested similarly. 

АБХ 5 experiment on potatoes will serve as an illustration. The 
mean squares for the yields of potatoes were as follows. 


E 


d.f. m.s. 
Rows 4 62.6 
Columns 4 104.7 
Treatments 4 272.0 
Error 12 26.2 


If the experiment had been conducted in randomized blocks with the 
columns as blocks, the estimated error m.s. would be 


4(62.6) + 16(26.2) _ 
T 


The degrees of freedom available for error would increase from 12 to 16. 
To take account of the advantage of the additional degrees of freedom 
(section 2.31), we reduce 33.5 by the multiplier (13)(19)/(17) (18), or 
0.969. This gives 32.5 as a comparable mean square for randomized 
blocks. The row grouping increased the information by an estimated 
25%. Of course a considerable number of these comparisons would be 
needed for drawing general conclusions. 


33.5 


4.4 Cross-over Designs 


4.41 Description. In dairy husbandry and biological assay a design 
has been used which closely resembles the latin square but may have 
some advantages when the number of treatments is small. In the sim- 


128 CROSS-OVER DESIGNS 4.41 


plest case where there are two treatments A and В, the units are first 
grouped into pairs as if a randomized block design were to be used. 
Suppose that from previous knowledge one unit in each pair is expected 
to give a higher response than the other and that the difference in favor 
of the superior unit is expected to be about the same in all pairs. It will 
clearly be advisable to ensure that each treatment is applied to the 
“better” member in half the replicates and to the “роогег” member in 
the other half. The pairs or replicates, which must be even in number, 
are divided at random into two equal sets, the first set to receive treat- 
ment A on the superior member of each pair, the second to receive B. 

The experiment discussed in chapter 1 provides an example. The 
data (shown in table 4.10) are the times required to compute the sums 
of squares of 27 observations on each of two machines whose speeds it was 
desired to compare. Ten sums of squares were calculated, making 10 
replications of the experiment. The cross-over design was used because 
it was thought that the second computation of a sum of squares might 
be faster than the first, so that for a fair comparison each machine should 
be used first in five of the ten replications. The randomization assigned 
machine A to be used first in replications 1, 3, 6, 8, and 9. 

The degrees of freedom in the analysis of variance are as follows. 


d.f. 

Columns (pairs or replicates) 9 
Rows 1 
Treatments 1 
Error 8 
Total 19 


Note that only 1 d.f. is assigned to rows. That is, the arrangement re- 
moves from error only the average difference between the 2 rows. In so 
far as the difference is not constant but varies from one replication to 
another, this variation enters into the experimental error. 

In some experiments it is known that this difference will vary, and it 
is possible to estimate in advance (at least roughly) whether the dif- 
ference will be large or small for a given replicate. Suppose that the dif- 
ference is expected to decrease steadily from replicate 1 to replicate 10. 
With this knowledge a more accurate design is obtained by the use of 
five 2 X 2 latin squares, as shown below. 


TABLE 4.9 Five 2 X 2 LATIN SQUARES FOR 2 TREATMENTS IN 10 REPLICATES 


Square 
Rows I п ш IV У 
1 2 3 4 5 6 1 5, 9 10 
Better А В ИВ В А А B A UH 
Poorer B А B A A B B A B А 
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In this arrangement we remove from error 5 d.f. for rows, one in each 
square. Thus not only the average difference is removed, but also the 
variation in the difference from square to square. The combined analysis 
of variance of the 5 squares is set out as follows. 


df. 

Squares 4 
Columns within squares 5 
Rows within squares 5 
Treatments 1 
Error 4 
Total 19 


The 9 4.1. for squares and columns within squares are exactly the same 
as the 9 4.1. for columns in the cross-over design. Owing to the more 
complete elimination of the row effects, there are only 4 d.f. for error 
instead of 8 with the cross-over plan. 

To summarize, the cross-over design is particularly appropriate when 
the difference between the rows is substantially the same in all replicates, 
for in this case the whole of the real difference between rows is con- 
centrated in the single degree of freedom and the error variance is no 
larger than that of the latin squares. Even if the difference between rows 
is known to be variable, the cross-over design may be preferable in small 
experiments where few degrees of freedom are available for error. 

In dairy husbandry the cross-over may be used to compare the effects 
of two feeding rations on the amount and quality of milk produced by 
the cow. Since cows vary greatly in their milk production, each ration 
is tested on every cow by feeding it during either the first or the second 
half of the period of lactation, so that each cow gives a separate replicate. 
The milk yield of a cow declines sharply from the first to the second half 
of its period, so that the first half is always “better,” in the sense above. 
Whether а cross-over is superior to a set of latin squares depends on 
circumstances. The rate of decline is not constant frem cow to cow; it is 
greater in general for high-yielding than for low-yielding cows. Thus if 
previous production records are available, the cows may be divided into 
pairs on the basis of yielding ability, each pair being made a separate 
2 x 2 latin square. This plan is likely to give a smaller error than the 
cross-over, though sometimes not sufficiently smaller to counterbalance 
the loss of degrees of freedom. 

The design can be used with any number of treatments, subject to the 
restriction that the number of replicates must be a multiple of the num- 
ber of treatments. With three treatments, for example, a plan can be 
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drawn up from the 3 cycles ABC, ВСА, CAB, where the order of the 
letters denotes the row to which each treatment is applied. Each cycle is 
allotted at random to one-third of the replicates. For higher numbers of 
treatments a design is constructed in the same way from the columns of 
any latin square. When the number of treatments exceeds four, how- 
ever, the degrees of freedom for error are sufficiently large so that a set 
of latin squares is usually preferable. 


4.42 Statistical Analysis. All sums of squares are calculated in the 
usual way. For the example (table 4.10), the computations are as fol- 
lows. 


Ы 
Correction factor: (952) = 6195.2 
Total: (30)? + (14)? + (21)? +---- (23)? — 6195.2 = 910.8 
Columns: 3[(44)? + (42)? +- - -+ (47)] — 6195.2 = 357.8 
‚ (194 — 158)? 
Rows: 20 = 648 
в — 2 
Treatments: (Iur pO = 320.0 


20 


TABLE 4.10 CROSS-OVER EXPERIMENT FOR COMPARING THE SPEEDS OF TWO 
COMPUTING MACHINES A AND В 


Time (seconds minus 2 minutes) taken to calculate a sum of squares 


Columns (replications) 
Rows Totals 
1 2 3 4 5 6 7 8 9 10 


First (poorer) | А30 | B21 | 422 | B13 | B13 | 429 | B 7 | A12 | A23 | B24 | 194 
Second (bet- 


ter) В14 | 421 | B 5| 422 | A18 | B17 | A16 | B14 | B 8| A23 | 158 


Totals 44| 42| 27| 35| 31| 46| 23| 26| 31 47 | 352 
Treatment totals: А 216; В 136 


Analysis of variance 
Source of variation d.f. 


8.8. m.s. 
Columns (replications) 9 357.8 39.8 
Rows (first versus second) T 64.8 64.8 
Treatments 1 320.0 320.0 ** 
Error 8 168.2 21.0 

Total 19 910.8 
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The error s.s. is found by subtraction. The F-ratio for treatments is 
significant at the 1% level. 

We will calculate the confidence limits for the true difference in speed 
which were quoted in section 1.12. The average observed difference is 
8.0 seconds in favor of machine B. The estimated standard error of this 
difference is У 2(21.0)/10, or 2.049, based on 8 d.f. Consequently the 
confidence limits for the true difference are 


8.0 = 2.0497 


where # is the value of Student's ¢ corresponding to (1 — P), where P is 
the confidence probability chosen. For P = .95, # = 2.306, and the 
limits are (8.0 + 4.7), or 3.3 and 12.7. For P = .80, # = 1.397, and, 
for P = .99, # = 3.355, giving confidence limits of (5.1, 10.9) and (1.1, 
14.9), respectively, as quoted. 

Worked examples for the case where there are two treatments are also 
given by Brandt (4.14) and Fieller (4.15); the latter also illustrates the 
analysis of covariance for this design and the procedure when one column 
is missing. 

In general, if there are # treatments and r replicates, the degrees of 
freedom subdivide as follows. 


df. 
Columns (елг4) 
Rows (4-1) 
‘Treatments (t — 1) 
Error t- 1( — 2) 
Total (tr — 1) 


2 
4.43 Standard Errors. The usual formula = holds for the standard 


error of the difference between two treatment means. 
The substitution formula for a missing value is 
— 2G 
rC + (R + T) (47) 
(t— 1)(r — 2) 
where the capital letters refer respectively to the totals of the column, 
row, and treatment in which the missing value appears and to the grand 


total. The difference between the mean of the affected treatment and 
the mean of a treatment with no missing value has a standard error 


2 i 
ЕР as 4.8 
5, É 2 r(t — 1)(r — zl (8) 


y= 
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4.5 Triple Grouping: Graeco-latin Squares 


4.51 Description. In this arrangement the treatments are grouped 
into replicates in three different ways with the consequence that the ef- 
fects of three different sources of variation are equalized for all treat- 


ments. 
TABLE 411 А 5 X 5 GRAECO-LATIN SQUARE 


Litters Pen (weight groups) 
а b c d e 
I |4 В; Cs D» Е; 


The additional grouping (usually represented by greek letters) is de- 
noted here by subscripts. Each treatment (A,---, E) appears once in 
each row and column and once with each subscript (1-.-5). 

One example is an arrangement proposed by Dunlop (4.16) for 
testing 5 feeding treatments on pigs. The arrangement requires 5 pigs 
from each of 5 litters (I, II, III, IV, and V), the effects of litter dif- 
ferences being equalized by making litters correspond to the rows of the 
square. The animals are housed in 5 pens (a, b, c, d, and e) which con- 
stitute the columns. The columns are used also to control differences in 
weight; i.e., the heaviest animal in each litter is assigned to the first pen, 
the next heaviest to the second pen, and so on; consequently the columns 
eliminate simultaneously the pen differences and the principal variations 
in weight within litters. Thus far the design is an ordinary latin 
square. 

The subscripts signify the positions of the five feeding crates within 
each pen. This extra control, though possibly unnecessary, was Sug- 
gested for several reasons. The first and fifth crates in each pen were of 
different construction from the other crates; moreover the pigs occupying 
the end crates have less “company” when feeding. 

The graeco-latin square has not been used often, probably because the 
units can seldom be balanced conveniently in all three groupings. In the 
example discussed above, the practical difficulty was to secure five litters 
of which each contained five or more pigs. An interesting industrial ap- 
plication is described by Tippett (4.17). 
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The designs have been constructed for all numbers of treatments from 
3 to 12, except 6 and 10; examples are shown in plan 4.2 (p. 146). 


4.52 Randomization. Arrange the rows and columns independently 
at random. Assign the latin letters and the subscripts at random to 
their respective classifications. 


4,58 Statistical Analysis. The sums of squares for rows, columns, 
treatments, and subscripts are all obtained in the usual way. For an 
т Xr square the error has (г — 1)(r — 3) degrees of freedom. This 
number is rather inadequate when r is less than 6. 


46a Designs for Estimating Residual Effects When 
Treatments Are Applied in Sequence 


461a Adjustment for Residual Effects. One of the most common 
uses of cross-over and latin square designs is in experiments in which 
the different treatments are applied in sequence to the same subject, 
animal, machine, or plot. The rows of the square represent the succes- 
sive periods of application, while the columns represent the subjects, 
animals, machines, or plots. The standard methods of analysis which 
we have described involve the implicit assumption that there is no 
residual or carry-over effect of any treatment into the succeeding period. 
Where there seems to be some risk of residual effects, a common practice 
is to separate any two periods of treatment by an interval of time long 
enough for the residual effects to have died out. 

To allow a long enough “rest period" is sometimes not feasible, how- 
ever, or is undesirable on other grounds. In agricultural field experi- 
ments we should not want the rest period to be longer than the interval 
between the harvesting of one year's crop and the sowing of the next 
crop. In some dairy cow feeding experiments the whole experiment 
must be completed in one lactation, so that the tota! time available for 
treatment periods plus rest periods is limited. And in general, the 
shorter the rest period, the sooner the experiment is finished. 

Another solution is to change the method of analysis of the results so 
that the treatment means can be adjusted for residual effects. To see 
what this implies, suppose that the experiment consists of the two 
3 X З latin squares shown in table 4.62. 

Besides producing direct effects ta, ір, and t, during the period in which 
they are applied, the treatments also produce residual effects ra, ть, and 
Te, respectively, in the period immediately following the one in which 
they are applied. Thus for the third period in sequence I, the predicted 
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TABLE 4.6& А DESIGN FOR THE ESTIMATION OF RESIDUAL EFFECTS 


Sequence 
Period І II ІП IV У VI 
$ A B с А В с 
2 В с А C A B 
3 с А В В с А 


total treatment effect is (te + ть), since treatment С is given in the third 
period and treatment B in the period immediately preceding. For the 
second period in this sequence the total treatment effect is likewise 
(ly + та). In the first period the treatment effect is simply ta, since it 
is presumed that all the experimental material was treated alike prior 
to the start of the experiment. If the quantities ża, £j, and te are esti- 
mated from this model, they provide measures of the direct effects that 
are free from the disturbance due to residual effects. 

The statistical analysis is made easier if any treatment is preceded 
equally often in the design by each of the other treatments. The design 
in table 4.6a has this property, every treatment being preceded twice by 
each of the other treatments. Designs with this property have been 
called balanced with respect to residual effects, although the balance is 
incomplete because a treatment is never preceded by itself. Williams 
(4.5a) has shown that if the number of treatments is even, this type of 
balance can be achieved by the suitable choice of a single latin square. 
When the number of treatments is odd, two latin squares are required 
for balance. Balanced designs for 4, 5, and 6 treatments are given in 
table 4.7а. 


TABLE 4.7a DESIGNS BALANCED FOR RESIDUAL EFFECTS 


Four treatments Six treatments 
Period Period 

1 AVRO н ED 1 ae Vy АЕ Р 

2 Boris Cys DIO] A 2 CHa Dae @ F A B 

3 D A B с Б] В с D E F A 

4 С ued. кА ЕВ 4 Жаса К\з аА В C ‚р 
5 Kad, AB с р Е 
6 D E F A B с 

Five treatments (use both 
Period : es ina 


ы со мо н 
Ot» 
ORO 
өысыьоо 
оные 
BOQ 
SewQap 
abad 
Борыс 
BARS 
abdi 
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Since the 4 X 4 square provides only 3 d.f. for the experimental error, 
at least two squares are recommended with this design. With 3 or 5 
treatments, the number of squares should be a multiple of 2. Treat- 
ments are assigned to letters at random, and the columns (sequences) 
arerandomized. If the experiment has more than one square, the experi- 
menter must decide whether to remove the period effects separately in 
each square, as is best if period effects are likely to differ from square to 
square, or to remove only the overall period effects, as in a cross-over 
design. In the former case the squares are kept separate and columns 
are randomized within each square: in the latter, all columns are ran- 
domized. 


4.62a Statistical Analysis When Residual Effects Are Present. The 
analysis follows methods developed by Williams (4.5a). The numerical 


TABLE 4.8a PLAN AND MILK YIELDS PER PERIOD (IN 10 тв. - 100) 


Sequence (cow) 
Period I II ш Totals IV v УТ Totals 


1 A 38 B 109 С 124 271 А 86 В 75 С 101 262 
2 B25 С 86 А 12 183 C 76 A 35 B 63 174 
3 C 15 A 39 В 27 81 В 46 С 34 Mort 81 
Totals 78 234 223 535 208 144 165 517 
Computation of direct and residual effects 
т Direct effect 
T R F Тао means В = 247 f в 
А 20 162 399 —383 42.5 —193 -8.0 +46 
В 345 137 431 —56 56.1 —100 -4.2 —83 
C 486 220 222 +439 76.7 +293 +12.2 +37 
1052 519 1052 0 0 0.0 0 
Analysis of variance 
Source of variation d.f. 8.8. m.s. 
Sequences (cows) mn —1 5 5,781.1 
Periods within squares m(n — 1) 4 11,489.1 
Sat effects (unadj.) n—1 2 2,276.8 
Residual effects (adj.) n—i 2 616.2 308.1 
ji wem effects (unadj.) n—1 2 38.4 
Direct effects (adj.) n—1 2 2,854.6 1,427.8 
Error (n — 1)(mn — m — 2) 4 199.2 49.8 


"Total тт? —1 17 20,362.4 
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example in table 4.8a comes from an experiment on the feeding of dairy 
cows (4.602), the treatments being as follows: 


А = Roughage; В = Limited grain; C = Full grain 


The data are the milk yields per period (6 weeks). The original experi- 
ment consisted of six 3 X 3 latin squares, of which two have been chosen 


to illustrate the computations. 
Compute the marginal totals shown in table 4.8а. Additional sym- 


bols and formulae needed are as follows: 
п = number of treatments = 3 
т = number of squares = 2 
For each treatment, compute 
T — treatment total 


В = total of the yields in periods immediately following the 
application of this treatment. For treatment А, 


R = 25 + 27 + 76 + 34 = 162 


F = total of the sequences (columns) in which this treatment is 
the final one. For treatment A, 


F = 234 + 165 = 399 


If P, = 533 is the total of all yields in the first period and G = 1052 is 
the grand total, we зо require the two quantities 


P,—nG- P,—3G- 533 — 3(1052) = — 2623 
AP, — (n + 2)@ = 3P, — 56 = 3(533) — 5(1052) = —3661 


For each treatment, the values of T, R, and F and the subsequent 
computations are set out under the plan in table 4.8a. For the esti- 
mated direct effect of a treatment, the general formula is 


mn(n? — n — 2) = (n? — n — DT +nR +F + (P, — nG) 
Ai = 5T +3R 4 poe 2623 
The values of 24/ should add exactly to zero оуег all treatments. The 
figures are then divided by 24, and the general mean 1052/18 = 58.44 
is added to each quotient to give the direct effect means, which can be 


regarded as the treatment means per period, adjusted so as to remove 
residual effects from the previous period, 
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The estimated residual effect # of a treatment is given by the equation 
тп(п? — n — 2)? = nT + п?В + nF + пр, — (n + 2)G 
24f = ЗТ + GR + ЗЕ — 3661 
These figures also sum to zero over the treatments. The f values are 
obtained on division by 24. The general mean can be added to each of 
the f values if desired. 

It may help to clarify the process if we show how the direct effect 
means can be obtained from the unadjusted treatment means. For 
treatment А, the unadjusted mean is 271/6, or 45.2. In terms of the 
model, this quantity is 

6 (direct effect mean) + 2% + 27, 
6 
since treatment A is twice preceded by B and twice by C. Hence 
Direct effect mean = 45.2 — $(f, + fj) = 45.2 — 1(—4.2 + 12.2) 
= 42.5 


45.2 = 


as already found. 

Tn the analysis of variance (table 4.8a), the total sum of squares and 
the sums of squares for sequences (cows) and periods are found in the 
usual way. 


Correction factor: С = — = ——— = 61,483.6 
mn’ 18 


Total: (38)? + (109)? +- --+ (1)? — С = 20,3624 
Sequences: 3[(78)? + (234)? +++++ (165)?] — С = 5,781.1 
Periods: $[(271)? + (183)? +++++ (81)?] — ${(535)? + (517)?] 
= 11,489.1 


The sum of squares for Periods is computed above as “ВеіжеепРегіойв 
within squares,” since the experiment was planned as a set of latin 
Squares and it was intended to remove period effects separately in each 
square. 

Owing to the fact that the direct and residual effects are to some de- 
gree entangled, their sums of squares will not add up correctly unless 
they are calculated appropriately. The total sum of squares for treat- 
ment effects may be computed either as 

Direct effects (unadjusted) + Residual effects (adjusted) 
or as 
Direct effects (adjusted) + Residual effects (unadjusted) 


138 DESIGNS FOR ESTIMATING RESIDUAL EFFECTS 4.628 


Tt is best to obtain all four sums of squares, since this supplies a check. 
The quantities to be squared will be found in the Computation" sec- 
tion under the plan in table 4.8a. 


Direct (unadjusted) : 
ut -С- (ату + (345)? + (436)?] — С = 2,276.8 
mn 


Residual (adjusted) : 
re 
mèm? — п — 2) 
Direct (adjusted) : 
19% 


mn(n? — n — 1)(n? — n — 2) 


1 5 Д 
= — 100)? + (293)2] = 616.2 
216 [(193)? + (100)* + (293)?] б 


m uM 2 2 7 2 
“76 [(383)? + (56)? + (439)?] 


= 2,854.6 
Residual (unadjusted) : 
For this we need the auxiliary quantities 
R=-R+G-nT=R+G-37 


shown on the extreme right of the “Computation” section. The sum of 


squares is 
> В? 1 2 2 2 
кк тонери дро [(46)? + (83)? + (37)2] = 38.4 


As a check, note that 

2,276.8 + 616.2 = 2,854.6 + 38.4 
The Error sum of squares is found by subtraction. If only the over- 
all, sum of squares for Periods is removed, the d.f. for Error become 
(n — 1)(mn — 3). 

In making F-tests of the direct and residual effects, the adjusted mean 
squares must be used. In this example the direct effects give a sig- 
nificant F-value, but residual effects do not attain significance, although 
they did so in the larger experiment from which the data are taken. 


4.63а Standard Errors. The standard error of the difference between 
two direct effect means is 


bs? Y (n? — n- 1) " 2(49.8) i (5) Т” 
т (2-тп-9) 6 (4) 5 
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where r = mn is the number of replicates of each treatment. For the 
difference between two residual effect means, we have 


T п? em (9) 

= = ` = 6.11 
т (?—n—92) 6 (4) 
4.64а Some Further Developments. Like the latin squares, these de- 
signs are restricted as to the number of replications and the number of 
periods. Patterson (4.10a) has investigated the incomplete block de- 
signs that can be balanced with respect to residual effects: these designs 
will be useful when the number of treatments exceeds the number of 
periods. 

In the statistical analysis given above, adjustment is made only for 
residual effects from the period immediately preceding. It is assumed 
that no residual effects persist from treatments applied two periods 
back. Designs can be constructed that are balanced both for treatments 
immediately preceding and for treatments applied two periods back 
(Williams, 4.5a, 4.82). For n treatments, (n — 1) squares must be used. 
А completely orthogonal set of latin squares supplies the necessary de- 
sign. This is а set of (n — 1) squares such that when any two squares 
are superimposed, each letter of one square occurs once with every letter 
of the other square. А doubly balanced design also exists for n = 6, for 
which there is no completely orthogonal set of squares. 

When the period means exhibit a marked trend, as in the case of dairy 
cow milk yields within a lactation, Patterson (4.72) has shown that a 
more complex mathematical model may be required in order to obtain 
unbiased estimates of error from the analysis. If the trend in the period 
means is analyzed into its linear, quadratie, and higher-degree com- 
ponents, the variance from cow to cow of the linear component may be 
substantially larger than that of the other components. When this hap- 
pens, the standard error obtained for direct effect means from the analy- 
sis given above is subject to a bias, which can be removed by applying 
the more realistic model. From an examination of data from dairy cow 
experiments, Lucas (4.92) concluded that the bias did not appear to be 
of practical importance in 3 X 3 experiments, but might be so in 4 X 4 
experiments. 


465a Designs for More Thorough Investigation of Residual Effects. 
From the standard error formulae in section 4.632, it is seen that the 
standard error of a difference is greater for residual than for direct effects 
in the ratio n/V (n? — n — 1), so that these designs give less precise 
estimates of residual than of direct effects. This is not undesirable if 
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the experimenter uses the residual effects solely in order to obtain un- 
biased estimates of the direct effects. In some lines of work, however, 
estimates of the sizes of the residual effects are of as much interest as 
estimates of the sizes of the direct effects. For this purpose, another 
drawback of the designs given above is that there is a positive correla- 
tion between the estimated residual and direct effects of a treatment. 
This implies that the tests of residual and direct effects are not quite 
independent. 

Designs which give independent estimates of direct and residual ef- 
fects, of approximately equal precision, are obtained by adding an extra 
period to the designs above (4.13a). In the new final period, the treat- 
ments that were applied in the previous final period are repeated. Table 
4.9a illustrates the design for three treatments, obtained by extending 
the two 3 X 3 latin squares. 


TABLE 4.0а AN ALTERNATIVE DESIGN FOR ESTIMATING RESIDUAL AND DIRECT 


EFFECTS 
Sequence 
Period I IL Ш аул I 
т А В с А В с 
2 ВС Chal Aas В 
3 C o. 1,8. Bor „4 
4 Се А n NU 4 


Any treatment is now preceded equally often by every treatment, in- 
cluding itself. This makes the estimates of direct and residual effects 
orthogonal. In the notation of section 4.62a, table 4.10a gives the 


TABLE 4.108 FORMULAE FOR DIRECT AND RESIDUAL EFFECTS 


Direct effects Residual effects 
Баа po @tDT-F-¢ pete Pi 6 
mn(n + 2) mn? 
s.s. in analysis of variance CS EET iim Gr Za © 
mn(n + 1)(n + 2) mn? 


s.e. of diff. between 2 means | 4 [28m T +1 ре 
mn(n + 2) тп 


formulae needed in the analysis. Although direct effects are more рге- 
cisely estimated than residual effects, the difference in precision is much 
reduced as compared with previous designs. Note that since direct and 
residual effects are orthogonal, their sums of squares appear only once 
in the analysis of variance. 
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Numerous variations in the design are possible. If the magnitude 
and duration of the residual effects are of primary interest, we might 
use a design in which after the first period each treatment is followed 
for several periods by a standard treatment S or by no treatment. With 
three treatments, the sequences would гип ASSS, BSSS, CSSS. Some- 
times interest lies in the cumulative effect of repeated application of a 
treatment: for this, sequences AAA, BBB, CCC are used. The experi- 
ment may contain some cumulative sequences and some balanced se- 
quences, giving estimates of direct, residual, and cumulative effects. 
These designs have been used to some extent in experiments on perennial 
crops and on crop rotations in agriculture (4.11а, 4.12a). 

A special problem arises in dairy cow feeding experiments. In prac- 
tice, the farmer would probably give a successful feed to the cows during 
the whole of their lactations. Consequently, the type of experiment 
that simulates practical conditions is one in which each feed is given 
during the whole of the lactation. Such experiments tend to have high 
standard errors, however, because the variation between cows enters 
into the error. Consequently the latin square design is sometimes used 
as a substitute that may give more precise results because variation 
among cows does not contribute to the error. For this purpose, how- 
ever, the quantity that we should try to estimate from the latin square 
is not the direct or residual effect, but the cumulative effect that would 
result if a treatment were applied over all periods. Under certain as- 
sumptions the average cumulative effect per period is approximated by 
the sum of the direct and the residual effects. 


4662 Switchback or Reversal Designs. During the period of lacta- 
tion the milk yield of a cow rises at first but thereafter exhibits a decline. 
The rate of decline varies markedly from cow to cow. With latin square 
designs, these variations in rate of decline contribute to the experimental 
error. For instance, if treatment C is given in the final period to a 
group of cows that have small declines, while A is given last to cows 
that have steep declines, the experiment favors C over A. 

The switchback design copes with this source of error. To compare 
two treatments, three periods are used. Half of the animals receive the 
sequence АВ» Аз, the other half the sequence Ву АэзВз, where the suffixes 
denote the periods. For an animal in the first group, the difference per 
period between A and B is estimated as 


Аз + Аз _ 
2 
The important point about this estimate is that it is orthogonal to the 


Ba 
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linear component of the period effects. If the decline is linear, the rate 
of decline, whether small or large, does not contribute to the error of 
the comparison of A with B. For an animal in the second group, the 
treatment difference is estimated as {А — (В, + Вз)/2}, and the same 
property holds. 

With four periods, the double reversal design uses the two sequences 
А:В2АзВа and B,AgB3A,4. For a cow in the first group, the estimate 


Ay + ЗАз — ЗВ. каб В, 
4 


is orthogonal to both the linear and the quadratic components of the 
period trend. The treatment comparison is unaffected by that part of 
the trend which can be represented by a second-degree curve in time. 

The analysis of these designs has been described, with examples, by 
Brandt (4.14) and later by Ciminera and Wolfe (4.14a). Extensions of 
the designs to more than two treatments are given by Taylor and Arm- 
strong (4.15a) and Lucas (4.16a). Since the designs have a specialized 
objective, it is advisable to compare their precision with that of the 
latin square before adopting them as the best tool. 
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Selected latin squares 


Plan 4.1 


4x4 


3x3 


AoA 
CASA 
ADO 
җщод 


аса 
оза 
мачо 
ласа 


алас 
DANA 
MOAN 
SAMOA 


Rote 
CART 
MIAD 
IADA 


ox 
аҹ 
час 


7X7 


6x6 


5x5 


әчаодық 
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RQROWNQOA 
AARONO 
олым 
fao WR OX 
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RAIADO 
AIORA 
aora 
ойына 
мячом 
ADARNA 
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QoRxm 
omxmaa 
чаво 
чауоаң 


ызасаымыош 
цьаоанао 
оцычасаны 


`мощычмоанм 


o 
x 
o 


8x8 


ARONIA 
AARONO 
раненым 
MOARROMAY 
C OQ RE CH N 


шчаодымо 
әшчаосдашы 
ымәшчаосам 
QROMTQOA 
ARORA 
содықошіа 
содымошеч 
заодаықош 


11 X11 


10 X 10 


ачасадықош»ы» 
ымчасанысшы 
ыымчасамкош 
шцаымчаосанмыо 
Dunkin on 
оныьмчаоаы 
ARORAKN AMOR 
ARRON KRS TROD 
O QR R O5 RSV S d LRQ 
fO QR, t5 Rios S hd d 
MA DARE. 


RINOR шы 
һъчауоаыафо 
RAK часа 
BRAK TROARR 
BROWN Ччаосан 
қызшы,чаоса 
ARRORRKR Чао 
OQ f R t5 RSS ча 
MOARRSAAKT 
чауоаҷаощы 
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Plan 4.1 (Continued) 


SSN QS Wa 


GRAECO-LATIN SQUARE DESIGNS 


Selected latin squares 
12 X 12 
Recep k ТГУ Кк? 
C ЕЕ G.H 1. J ЕТА 
DRR F G HI'JEKLAS 
ЕСА УОК b AaB C 
ЛЕН LJ uke b 4,8,0, 0 
НИТ & ЖАБА ВСЕ 
Hw E АВ DR Ж 
ЧК Е АВ СР ЕЕ 
4 ГМА B Q.D E F Ө Н 
EK Loa Bc D^F F G H 
BABQDEBPGHI F 
ABPcCDAPGAEIS K 
Graeco-latin squares 
4x4 5x5 
Ат Вз С, D: Аі Bs Cs 
В, Ag Ds Cy В C4 Di 
C; Di Аз В, Сз Ds Ез 
D, C» В, Аз Dy Е! Аз 
Es Аз B4 
7 8х8 
Ез Р: б Ai Bs C$ Dg Е 
Fy G As В, Ag бі Е; Hs; 
Gs Аз В Cs б, Ат № D: 
Ав Вз Ст De Fs Ев As Cs 
В Cy Di Еу Hı Ds Cy А 
Cı Ds E» Fe Dz На Вз Gs 
D» Es Fs бі Cs Bs Н» Fy 
Hs Es Fs Ge В; 


PLANS 147 


Plan 4.2 (Continued) Graeco-latin squares 


CHAPTER ^5 


Factorial Experiments 


5.1 Description 


5.11 A 2? Factorial Experiment. In a factorial experiment the effects 
of a number of different factors are investigated simultaneously. The 
treatments consist of all combinations that can be formed from the dif- 
ferent factors. To illustrate the simplest case, consider an experiment 
on sugar beet with 2 factors. These were nitrogen, none (по) versus 3 
cwt. sulphate of ammonia per acre (nı), and depth of winter ploughing 
(7 їп. versus 11 in). Ploughing took place in late January, the nitrogen 
was applied in late April, and the seed was sown early in May. Since 
both factors occur at 2 levels or variations, the experiment is described 
as a 2 X 2 factorial experiment. The 4 treatment combinations are 
shown below, with the mean yields of sugar per acre (ewt.) underneath. 


Treatment combinations and yields of sugar (cwt. per acre) 


1 2 3 4 
(no, 7 in.) (ny, 7 in.) (no, 11 in.) (ny, 11 in.) 
40.9 47.8 42.4 50.2 


The yields may be placed in the following 2 X 2 table. 


Nitrogen Response 
Depth no nm Mean ton 
Тіп. 40.9 47.8 44.4 +6.9 
11 in. 42.4 50.2 46.3 +7.8 
Mean 41.6 49.0 45.3 


И in. minus 7 in. | 1.5 42.4 


The results might be summarized as follows. Considering the effect 
of nitrogen, we might report that the application of nitrogen increased 
yields by 6.9 cwt. with shallow ploughing and by 7.8 cwt. with deep 
ploughing. These figures are called the simple effects of nitrogen. They 
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represent the type of information that would be wanted, for instance, 
in giving advice to a farmer who always used shallow ploughing but was 
doubtful whether to apply nitrogen. For the simple effects of depth of 
ploughing, we might report that 11 in. ploughing was superior to 7 in. 
by 1.5 ewt. in the absence of nitrogen and by 2.4 ewt. when nitrogen was 
applied. 

There is another way of looking at the results. It sometimes happens 
that the effects of the factors are independent. By this we mean that the 
response to nitrogen is the same whether ploughing is shallow or deep, 
and that the difference between the effects of deep and shallow ploughing 
is the same whether nitrogen is present or not. In this event the two sim- 
ple effects of nitrogen, 6.9 cwt. and 7.8 ewt., are estimates of the same 
quantity and differ only by experimental errors. On this supposition we 
would naturally average the two figures in order to estimate the response 
to nitrogen. The average, 7.4 ewt., is called the main effect of nitrogen. 
It can be derived alternatively as the difference between the two column 
means in the table, 41.6 and 49.0. Similarly the main effect of depth of 
ploughing (11 in. minus 7 in.) is the average of 1.5 cwt. and 2.4 cwt., or 
1.9 cwt. Note that a main effect, being an average of the simple effects, 
is more precisely estimated than the latter. In this experiment the 
standard error of a main effect is 1/4/2 times that of a simple effect. 

Consequently, if we are sure that the factors operate independently, 
the summary that was given above in terms of simple effects may be 
replaced by another that is both more concise and more accurate. This 
might read as follows. “Тһе application of nitrogen increased the yield 
of sugar by 7.4 cwt., while 11 in. ploughing increased the yield by 1.9 
cwt. as compared with 7 in. ploughing.” It is worth repeating that when 
the factors are independent the figure 7.4 ewt. is the best estimate not 
only of the average response to nitrogen, but also of the response on plots 
ploughed to 7 in. and of that on plots ploughed to 11 in. In other words, 
the whole of the information in the experiment is contained in the main 
effects, 

The question arises: How do we know whether the factors are inde- 
pendent? Frequently the answer is suggested by knowledge of the 
processes by which the factors produce their effects. In the present case 
ап agronomist might reason that deep ploughing should enable the plant 
to develop a more vigorous root system. With this the plant should be 
able to utilize more effectively any added nutrient such as nitrogen. 
Thus he might predict that the response to nitrogen would be greater 
with deep than with shallow ploughing, though he probably would not 
expect it to be much greater. In short, he would predict that the two 
factors would not be quite independent in their effects. 
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In addition to the information that may be available from such 
reasoning, a factorial experiment itself provides a test of the assumption 
of independence. For, if the depth of ploughing does affect the response 
to nitrogen, the difference between 7.8 ewt. (the response to nitrogen 
with deep ploughing) and 6.9 ewt. (the response with shallow plough- 
ing) is an estimate of this effect. The difference, 0.9 ewt., can be tested 
in the usual way by a t-test. If it proves significant, the assumption of 
independence is rejected by the data. The difference (sometimes divided 
by a numerical factor) is called the interaction between nitrogen and 
depth of ploughing. 

Interchanging the roles of the two factors, we may also consider 
whether the superiority of deep over shallow ploughing is affected by the 
presence of nitrogen. To measure the interaction in this case, we sub- 
tract 1.5 cwt. (superiority of deep ploughing when no nitrogen is added) 
from 2.4 cwt. (superiority when nitrogen is added). The difference is 
again 0.9 ewt. It is easy to see that this equality always holds with a 
2 X 2 experiment. Each difference is equal to the sum of the observa- 
tions in one diagonal of the 2 X 2 table, minus the corresponding sum 
for the other diagonal. Such interactions are called two-factor, or first- 
order, interactions. 


5.12 Advantages of Factorial Experimentation When Factors Are 
Independent. The advantages of factorial experimentation naturally 
depend on the purpose of the experiment. We suppose for the present 
that the purpose is to investigate the effects of each factor over some pre- 
assigned range that is covered by the levels of that factor which are used 
in the experiment. In other words the object is to obtain a broad picture 
of the effects of the factors rather than to find, say, the combination of 
the levels of the factors that give a maximum response. One procedure 
is to conduct separate experiments each of which deals only with a 
single factor. Another is to include all factors simultaneously by means 
of a factorial experiment. 

И all factors are independent in their effects, the factorial approach 
will result in a considerable saving of the time and material devoted to 
the experiments. The saving follows from two facts. First, as we have 
seen, when factors are independent all the simple effects of a factor are 
equal to its main effect, so that main effects are the only quantities needed 
to describe fully the consequences of variations in the factor. Secondly, 
in a factorial experiment, each main effect is estimated with the same 
precision as if the whole experiment had been devoted to that factor 
alone. Thus, in the preceding example, half the plots receive nitrogen 
and half do not. Consequently, the main effect of nitrogen is estimated 
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just as precisely as it would be in a simple experiment of the same size 
devoted to nitrogen alone. The same result holds for the effect of depth 
of ploughing. The two single-factor experiments would require twice the 
total number of plots in order to equal the precision obtained by the 
factorial experiment. If there were n factors, all at two levels and all 
independent, the single-factor approach would necessitate n times as 
much experimental material as a factorial arrangement of equal precision. 
The gain from factorial arrangements in this case is very substantial. 

Practical considerations may diminish this gain. The experimenter 
frequently lacks the resources to conduct a large experiment with many 
treatments, and must proceed with only one or two factors at a time. 
Further, it has been pointed out previously that, as the number of treat- 
ment combinations in an experiment is enlarged, the standard error per 
unit increases. This standard error is therefore likely to be higher for a 
large factorial experiment than for a comparable single-factor experi- 
ment. This increase in standard errors can usually be kept small by the 
device known as confounding, described in chapter 6. 


5.13 Factorial Experimentation When Factors Are Not Independent. 
We assume that the purpose is still to investigate each factor over the 
range represented by its levels. When factors are not independent, the 
simple effects of a factor vary according to the particular combination 
of the other factors with which these are produced. In this case the 
single-factor approach is likely to provide only a number of discon- 
nected pieces of information that cannot easily be put together. In 
order to conduct an experiment on a single factor A, some decision must 
be made about the levels of other factors В, С, D, say, that are to be 
used in the experiment (e.g., whether all plots should be ploughed 7 in., 
9 in., or 11 in. deep in an experiment on nitrogen). The experiment re- 
veals the effects of A for this particular combination of B, C, and D, but 
no information is provided for predicting the effects of A with any other 
combination of B, C, and D. With a factorial approach, on the other 
hand, the effects of A are examined for every combination of B, C, and 
D that is included in the experiment. Thus a great deal of information is 
accumulated both about the effects of the factors and about their inter- 
relationships. 

In this connection, Fisher (5.1) has pointed out that it is sometimes 
advisable to introduce into an experiment an extra factor that is not it- 
self of interest, in order that the experiment may form the basis for 
sounder recommendations about the other factors. In agricultural ех- 
perimentation in Britain, farmyard manure has served as a subsidiary 
factor of this kind. Any recommendations made to farmers about other 
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factors will be put to the test in some fields where the farmer has ap- 
plied manure and in others where he has not, so that it is well to in- 
vestigate the other factors in both the presence and the absence of ma- 
nure. 


5.14 Summary Comments. To summarize, the following are some in- 
stances where factorial experimentation may be suitable: 

1. In exploratory work where the object is to determine quickly the 
effects of each of a number of factors over a specified range. 

2. In investigations of the interactions among the effects of several 
factors. From their nature, interactions cannot be studied without test- 
ing some of the combinations formed from the different factors. Fre- 
quently, information is best obtained by testing all combinations. 

3. In experiments designed to lead to recommendations that must 
apply over a wide range of conditions. Subsidiary factors may be 
brought into an experiment so as to test the principal factors under a 
variety of conditions similar to those that will be encountered in the 
population to which recommendations are to apply. 

On the other hand, if considerable information has accumulated, or 
if the object of the investigation is specialized, it may be more profitable 
to conduct intensive work on a single factor or on a few combinations of 
factors. For instance, some investigations are directed towards finding 
the combination of the levels of the factors that will produce a maximum 
response. Friedman and Savage (5.2) present arguments to show that 
a well-planned series of single-factor experiments will often reach the 
maximum more quickly than a single large factorial experiment. An 
alternative strategy for this problem, due to Box and Wilson (5.1а), 
does use experiments in which all the factors appear, but for studying 
the nature of the response in the neighborhood of the maximum these 
authors developed special designs that differ from the standard factorials 
discussed here. Chapter 8A contains a fuller account of this problem. 

When the factors to be investigated are numerous, the chief disad- 
vantage of a single factorial experiment lies in its size and complexity. 
The magnitude of the task can be reduced by having only a single repli- 
cation (section 6.24). It is even possible (chapter 6A) to obtain most 
of the desired information by testing only a fraction (e.g. one-half or 
one-quarter) of the total number of treatment combinations, although 
this course is not without its risks, The difficulties arising from large 
experiments should not be considered a criticism of the factorial method, 
whose efficiency is greatest when the factors are numerous. The basis 
of the difficulty is that a major research program is being undertaken. 

Experimenters sometimes find the results of factorial experiments 
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difficult to interpret, because they appear to present a bewildering va- 
riety of treatment comparisons. It is true that the competent summary 
of a large factorial experiment demands an orderly procedure and often 
takes considerable time. If the factors are for the most part independent, 
the method of analysis by means of main effects and interactions (to be 
illustrated later) will reduce the data to manageable proportions. If 
the numerous factors interact in a puzzling manner, prolonged study of 
the results and further experimentation may be needed before the facts 
are mastered. The trouble in this case is that the phenomena are com- 
plex, not that the experimentation is faulty. 


5.2 Calculation of Main Effects and Interactions 


5.21 Notation for the 2” Series. The object of section 5.2 is to ex- 
plain how main effects and interactions are calculated, and how they are 
represented in the analysis of variance of the results. We begin with the 
2” series, where each factor occurs at only two levels. 

Yor this system the notation used is similar to that of Yates (5.3). 
Letters A, B, С, >>> denote the factors. The letters a, b, c, +++ denote 
one of the two levels at which the corresponding factor occurs; for pur- 
poses of clarity this level will be called the second level. The first level 
is signified by absence of the corresponding letter. Thus the treatment 
combination bd, in a 2* factorial experiment, means the treatment com- 
bination which contains the first levels of factors A and С, and the second 
levels of factors B and D. The treatment combination which consists of 
the first level of all factors is denoted by the symbol (1). 

The symbol (ab) will denote the mean of all observations which receive 
the treatment combination ab. The letters A, B, and AB, when they 
refer to numbers, will represent, respectively, the main effects of A and 
B and the A by B interaction. 


5.22 The 2? Factorial Experiment. As already shown in section 5.11, 
we have 


A = Ша) — @ + (à) — (01 
B = (а) + (b) — (&) — (01 


Yates (5.3) introduces the same multiplier 4 for the interaction, which 
he defines as 
AB = Hat) — (0) — (а) + (D) 


These quantities and the general mean M are shown in terms of the 
means for the treatment combinations in table 5.1. 
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TABLE 5.1 MAIN EFFECTS AND INTERACTIONS EXPRESSED IN TERMS OF 
INDIVIDUAL TREATMENT MEANS: 27 FACTORIAL 


Treatment combination 


Factorial Divisor 
effect 
(1) (а) @) (ab) 
М + +, + м 4 
А - + - + 2 
B ES = T + 2 
АВ T = " + 2 


The rows of the table express the factorial effects in terms of the 
original means. If the equations represented by the table are solved for 
(ab), (b), etc., in terms of M, A, ete., it will be found that the columns 
of the table enable us to express the original means in terms of the 
factorial effects. For example, from the column for (ab), 


(ab) = М+ MA +В + AB] 


The only point to remember is that the factor $ occurs with all terms 
except M. From these results, simple effects may be caleulated from 
factorial effects. "Thus 


(а) — (1) = simple effect of A when B is at the first level = A — AB 
(ab) — (b) = simple effect of A when B is at the second level = A + AB 


These relations are useful when an experiment has been summarized in 
terms of the factorial effects, and it is later desired to estimate some of 
the simple effects. From the above example it may be noted that the 
quantity AB measures the error that is committed in estimating the 
simple effects of A if the two factors are erroneously assumed to be in- 
dependent. 

The functions A, B, and AB satisfy the conditions for an orthogonal 
set of functions (section 3.42). Consequently the squares of the fac- 
torial effects, when suitably multiplied, divide the treatments s.s. inte 
three single components, each with 1 d.f. In practice these components 
will usually be calculated from the treatment totals [ab], [b], ete., rather 
than from means, If we define factorial effect totals as illustrated below, 


[А] = [ab] — [5] + [a] — Ш 
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then the sum of squares for A in the analysis of variance is [A]?/4r, where 
r is the number of replicates. Corresponding formulae hold for the other 
factorial effects. 


5.23 The 23 Factorial Experiment. In this case there are eight treat- 
ment combinations: (1), а, b, с, ab, ac, be, and abe. The simple effect of 
A is determined for each of four combinations of the other factors: (1), 
b, c, and be. As before, the main effect of A is defined to be the average 
of these four simple effects. 


A = (ав) — (bc) + (ab) - (b) + (ac) — (0 + (а) - (1)] 


Similar definitions hold for B and C. 
The interaction of A with B is now measured separately at each of the 
two levels of C. 


AB (С at second level) = $[(abe) - (bc) — (ac) + (о) 
AB (C at first level) = (а) — (b) — (а) + (1)] 


As would be expected, the quantity AB is taken as the average of these 
two effects. Thus 


AB = (ао) — (bc) — (ac) + (c) + (ab) — @) — (а) + (1)] 


There are two other first-order interactions, AC and BC, which are de- 
fined similarly. 

In addition, we encounter a new interaction. Separate estimates were 
given above for AB at each of the two levels of C. The difference be- 
tween these two estimates measures the effect of C on the AB inter- 
action. This difference, with the conventional factor 4, ів 


аһ) — (be) — (ac) + 9 — (ab) + @ + (а) - (01 


and may be called the interaction of AB with С. If the algebra is carried 
out, it will be found that the same expression measures the interaction 
of AC with B, and that of BC with A. Hence the quantity is called the 
ABC interaction. It is a three-factor, or second-order, interaction. 
Three-factor interactions are more difficult to understand than two- 
factor interactions. Fortunately, in practice three-factor interactions 
are often small relative to main effects and two-factor interactions; 
and quite frequently they can be neglected for the purposes to which the 
results are to be put. Occasionally, cases arise where they are important. 
It might happen, for instance, that factor A does not exert any influence 
unless factors B and C are present in the combination (bc). In this event 
the interaction ABC is as large as the main effect of A or the interactions 
АВ and АС. The same effect may occur in less extreme cases, where the 
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combination bc is particularly favorable to the response to factor A. If 
three-factor interactions are found to be substantial, a careful scrutiny 
of the simple effects is usually helpful in interpretation. 

The expressions for the factorial effects in terms of the treatment 
Means are summarized in table 5.2. The rows of the table give the 


TABLE 5.2 MAIN EFFECTS AND INTERACTIONS EXPRESSED IN TERMS OF 
INDIVIDUAL TREATMENT MEANS: 23 FACTORIAL 


Treatment combination 


do Divisor 


| 
| 


5 
Е LE 
+++ 
жізіітіз 
Таж +++ 
tiiti 
LI+I+I++ 
ЕТ 
++++++++ 
н> н> > и аны о 


factorial effects in terms of the treatment means, while the columns give 
the treatment means in terms of the factorial effects. For example, 


@=M+#A-B~C- AB — AC + BC + ABC] 
()-() = А — AB — AC + ABC 
(abe) — (1) = A+B+C+ ABC 


As before, the factor appears with all terms except M in the expression 
for a treatment mean. The difference between two treatment means, of 
which two examples are given above, is easily found by noting the signs 
in the two columns іп question. Я 

As the reader may verify, the 7 factorial effects are mutually orthog- 
onal, and each is orthogonal to М. Tfr is the number of replicates, each 
factorial effect has the same variance, a: The contribution of 

r 

any effect to the sum of squares for treatments is [ ]?/8r, where [ 1 
denotes the factorial effect total; €g., 


[4] = — [1] + [a] — [b] + [ab] ~ [c] + [ac] — [Бе] + [abc] 
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5.24 The 2” Factorial Experiment. A few formulae will be given 
which apply to any number of factors, all at two levels. With n fac- 
tors, there are n main effects, n(n — 1)/2 two-factor interactions, 
та — 1)(n — 2)/6 three-factor interactions, and so оп. The successive 
numbers are the coefficients in the expansion of (1 + 1)", omitting the 
first coefficient, unity. 

Two equivalent methods for writing out any factorial effect in terms 
of the original treatment means are available. Each is simple to re- 
member, though the actual writing may take some time in a large ex- 
periment. They will be illustrated by finding the BCDE interaction in 
а 2° experiment, with factors A, B, C, D, E, and F. 


Rule 1. Evens versus odds. In every factorial effect, half the treatment 
combinations receive a + sign and half a — sign. Those which receive 
one sign are those which contain an even number of the letters that ap- 
pear in the factorial effect. In the BCDE interaction 4 letters, b, с, d, 
ande, appear. There are 8 combinations that contain an even number of 
letters: (1), be, bd, be, са, се, de, and Басе. Each of these can be combined 
with any one of 4 combinations of the remaining letters a and f, namely, 
(1), a, f, and af. The 32 terms appear below. Since the terms contain 
abcdef, they receive by convention a + sign. 


(1) а f af 

be abe bef abef 
bd abd bdf abdf 
be abe bef abef 
са аса cdf acdf 
ce ace cef асе} 
de ade adef 


def 
bode abede Ый abcdef 


The 32 combinations with a — sign are those which have an odd num- 
ber of the letters b, с, d, or е. In detail, these are 


b ab bf abf 
с ас of acf 
d ad df adf 
e ae ef aef 
bed abcd bedf аЬ 
bee «аке beef «од 
bde abde bdef аме 
cde ай са) ау 


When calculated from the treatment means, this difference between the 
sums for the two groups of 32 combinations is divided by 2"~* to give the 
BCDE effect. 
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Rule 2. Algebraic. In this rule the BCDE interaction is expressed 


formally as 

BCDE = = (a+ 1)@ — 1)(с- 1)(d — 1)(e — D -1) 
Note that — signs appear with the factors that enter into the interaction 
and -- signs with those that do not. If this expression is expanded al- 
gebraically, it gives the interaction as a linear function of the treatment 
means. 

As in the 2? and 2? cases, all factorial effects are orthogonal to one an- 
other and to the mean. The contribution of any effect to the sum of 
Squares in the analysis of variance is [ P/2"r, where [ ] denotes the 
effect total. 


5.24a Yates’ Method of Computing Factorial Effect Totals. Yates 
(5.3) gives an automatic method of computing all the factorial effect 
totals without writing down the algebraic expressions for the factorial 
effects. If all or most of the factorial effects are wanted for study, 
Yates’ method is the most expeditious. If only a minority of the fac- 
torial effects are of interest, e.g., the main effects and two-factor inter- 
actions in a 2° factorial, a separate calculation of these effects may save 
time. 


TABLE 5.1а APPLICATION оғ YATES’ METHOD TO А 2! FACTORIAL т = MANURE, 
% = NITROGEN, р = PHOSPHORUS, k = POTASSIUM 
Yields per plot (total over 6 harvests, km. per 3-meter row) 
Rep.1  Rep.2 Rep.3 Кер. 4 Total 


(1) 32 43 27 19 121 
т 47 41 48 45 181 

n 26 36 24 18 104 
mn 61 76 56 64 257 
Р 29 39 27 28 123 
тр 51 34 40 48 173 
пр 36 31 32 30 129 
тпр 76 65 70 63 274 
k 35 42 56 35 168 
mk 63 41 60 53 217 
nk 80 68 75 67 290 
mnk 100 68 87 66 321 
pk 40 44 53 36 173 
mpk 64 39 75 72 250 
npk 105 99 74 73 351 
mnpk 90 82 89 101 362 


Total 935 848 893 818 3494 
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TABLE 5.1a (Continued) 


Yates’ method 
Treatment Effect Effect 
totals total mean 
а) @ (3) (4) 
а) 121 302 663 192 344 . G 54.6 
т 181 361 699 2132 576 М 18.0 ** 
т 104 296 996 408 682 М 21.3 ** 
mn 257 403 1136 168 104 MN 3.2 
? 123 385 213 166 176 Р 5.5 * 
тр 173 611 195 516 -10 МР —0.3 
пр 129 423 80 188 112 МР 3.5 
mnp 274 713 88 —84 —46 ММР -14 
Total 3494 4070 4856 5088 
k 168 60 59 36 770 K 24.1 ** 
mk 217 153 107 140 —240 мк —7.5 ** 
nk 290 50 226 -18 350 NK 10.9 ** 
mnk 321 145 290 8 —272 MNK —8.5 ** 
pk 173 49 93 48 104 РК 3.2 
mpk 250 31 95 64 26 MPK 0.8 
npk 351 77 —18 2 16 МРК 0.5 
mnpk 362 11 —66 —48 —50 ММРК -16 
Odds 1459 1642 2312 2192 5704 
Evens 2035 2428 2544 2896 88 


Total 3494 4070 4856 5088 5792 

** Double asterisk denotes significance at the 1% level. 

* Single asterisk denotes significance at the 5% level. 

The upper half of table 5.1a shows the yields of a 2* experiment, in 
four randomized blocks, on the effects of fertilizers on grass. In the 
lower half, Yates’ method is applied to the treatment totals. The steps 


are as follows: 
1. Write down the treatment combinations in the systematic order 


shown. The letters m, n, р, k are introduced in turn. The introduction 
of any letter is followed by its combinations with all previous treatment 
combinations. Thus p is followed by mp, "p, and mnp in turn. Place 
the corresponding treatment totals in the next column. 
2. In the top half of column (1), add the yields in pairs. 
302 = 121 + 181; . 361 = 104 + 257; ete. 
In the lower half of column (1), subtract the first member of each pair 
from the second. 
60 — 181 — 121; 153 = 257 — 104; ей. 
3. Apply the same process to column (1) obtaining column (2); then 
to column (2) obtaining column (3); and finally to col'imn (3), obtaining 
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column (4), which contains the factorial effect totals. The first number 
in column (4) is the grand total @. The succeeding factorial effects come 
out in the order in which the treatment combinations were written down. 
With n factors, the process is continued until we reach column (n). То 
obtain the factorial effect means, divide by 27-19, or in this case 32. 
Gis divided by 27, or 64, to give the mean yield of the whole experiment. 

Checks. The sum of squares of the factorial effect totals, divided by 
2"r, should equal the treatments sum of squares, computed in the usual 
way. This check verifies the whole process except for errors in the signs 
of the factorial effect totals and errors in labeling the factors. 

A series of intermediate checks is recommended for the larger fac- 
torials. In the “Treatment totals" column, add separately the odd 
numbered and the even numbered totals, i.e. 


Odds: 121 + 104 + 123 +---+ 351 = 1459 
Evens: 181 + 257 + 173 + ---- 362 = 2035 


as shown at the foot of the column in table 5.1а. 

Do the same in column (1), and also add the top half of this column. 
The latter total, 3494, should agree with the total 3494 of the previous 
column. A second check on column (1) is provided by the relation 


4070 — 3494 = 2035 — 1459 


These two relations constitute a satisfactory check on the values in 
column (1). In the same way, column (2) is checked from column (1) 
and so оп. 


TABLE 5.2a ANALYSIS OF VARIANCE 


d.f. 8.8. m.s. 
Replications 3 493.3 
Treatments 15 26,791.9 1,786.1 
Error 45 4,074.2 90.5 
Total 63 31,359.4 


In the analysis of variance (table 5.2a), it is unnecessary to divide the 
treatments sum of squares into single d.f., since tests of significance can 
be applied directly to the factorial effect totals or means, If s? is the 
error variance per unit, the standard errors for factorial effect totals and 
means are as follows: 


Effect totals: ве. = V 2"rs? = V/(64)(90.5) = 76.1 
Effect means: в.е. = Vs?/2"~r = V (90.5) /16 = 2.38 
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For 45 d.f., the 5% and 1% t-values are 2.014 and 2.690, respectively. 
A factorial effect mean must exceed (2.38)(2.014) = 4.79 for 5% sig- 
nificance, and similarly 6.40 for 1% significance. All main effects, and 
the interactions MK, NK, ММК are significant. The relationships 
among the effects of M, N, and K are interesting to examine from the 
three-way table of means for these factors. 

The alternative method of computing factorial effect totals, recom- 
mended when it is not desired to isolate the high-order interactions, is 
illustrated later in section 6.14. The first step is to write down the + 
and — signs for the main effects, M, N, P, and K. These are the only 
expressions that need be remembered. The expression for MN is found 
by multiplying corresponding signs in M and N. Similarly MNP is 
given by the product of corresponding signs in MN and P, and so on. 


5.25 Factors at More than Two Levels: а 4 Х 2 Factorial. Various 
notations are used. For example, the 3 levels of a factor А. шау be de- 
noted by ao, ал, аз, or by ал, аз, аз, or simply by the numbers 0, 1, 2. For 
illustration, we give’ below the treatment totals in a 4 X 2 experiment 
on sugar сапе. The treatments were 4 levels of dressing of potash, ko, 
kı, Ко, and kg, in arithmetic progression, and 2 levels of phosphate, po 
and p,. There were 5 replicates in randomized blocks. 


TABLE 5.8 TOTAL YIELDS OF SUGAR CANE (TONS PER ACRE) 
ko ky ks ks Total 


Po 180 248 277 285 | 990 
n 251 307 342 346 | 1246 


Totals | 431 555 619 631 2236 


The main effect of phosphate (P), which occurs at only 2 levels, is of 
course derived from the comparison of the 2 marginal totals, 990 and 
1246. For the main effect of potash (К) there are 4 marginal totals 
which may be compared. It will be recalled (section 3.42) that 3 in- 
dependent comparisons may be made amongst 4 totals, and that an in- 
finite number of such sets of 3 may be chosen. Thus the main effect of 
K comprises 3 independent comparisons. 

With the 2" system, we were able to define every main effect as a spe- 
cific linear combination of the treatment means. In the present case we 
could select a particular set of 3 independent comparisons, each of which 
would be a specific linear combination of the treatment means. These 
could be defined as the “components” of the main effect of K. However, 
the particular set that is most useful for the interpretation of the results 
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will change from experiment to experiment, so that a formal definition 
of this type would be of limited utility. The experimenter should use 
whichever set appears most relevant. 

Now consider the interaction between P and K. From the differences 
between the two rows in table 5.3, the effect of P is estimated separately 
at each level of K, the estimates being 


ko kı ka kg 
(ру — Po) 71 59 65 61 


Any comparison among these 4 figures is a measure of the effect of К on 
the response to P, and therefore is a component of the interaction be- 
tween P and K. Consequently the interaction between P and K con- 
sists of 3 independent comparisons, and is said to have 3 d.f. As with 
the main effect of K, the particular set of 3 components that will be of 
interest varies with the type of experiment. 

We may also wish to consider the interaction as the effect of P on the 
response to K. As we have seen, the response to K has 3 components. 
Since there are 4 increasing levels of potash dressing, we might choose as 
components the linear, quadratic, and cubic components of the response 
curve. Apart from a divisor, the linear component is 

—3(180) — 1(248) + 1(277) + 3(285) = 344 
at the lower level of P and 

—3(251) — 1(307) + 1(342) + 3(346) = 320 
at the higher level of P. The difference between these 2 figures, —24, 
estimates the effect of P on the linear response to K, and is a part of the 
interaction between K and P. The other two components are the ef- 
fects of P on the quadratic and cubic responses to K. 

In a 2” system, the interaction between P and K is identical with the 
interaction between K and P. In the more general case the correspond- 
ing result is that any component of the interaction between K and P can 
be derived from the components of the interaction between P and K. 
For example, the effect of P on the linear response to K can be written 


—24 = —3(71) — 1(59) + 1(65) + 3(61) 
so that it is a linear function of the responses to Р at the 4 levels of К. 
In this more general sense, the two interactions are still equivalent. 
We will consider the analysis of variance of the 8 treatment totals in 
some detail. With 5 replicates, the total s.s., on a single-plot basis, is 
(180)? + (251)? + (248)? + ...+ (346)? (2236)? 
40 


5 = 4165.2 
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The 7 d.f. subdivide into 1 for the main effect of P, 3 for that of K, and 
3 for the PK interaction. In practice the first two terms are calculated 
directly and the interaction obtained by subtraction. The computations 
are 
(1246 — 990)? 
SSS 


40 
" (431)? + (555)? + (619)? + (631)? (2236)? 
j 10 T 


= 1638.4 


= 2518.4 


TABLE 5.4 PRELIMINARY ANALYSIS OF VARIANCE OF DATA IN TABLE 5.3 


d.f. 8.8. m.s. 
P 1 1638.4 1638.4 
K 3 2518.4 839.5 
PK 3 8.4 2.8 
Total 7 4165.2 


As is common in agricultural experiments, the interaction m.s. is small 
compared, with those for the main effects (in fact it is below the mean 
square for error). 

Аз an exercise we will divide the К main effect and its interaction with 
P into linear, quadratic, and cubic components. The first step is to cal- 
culate the totals for these effects separately at each level of P. The re- 
sults are shown below, the multipliers for the 4 levels of K being shown in 
parentheses. These multipliers are obtained from a table of orthogonal 
polynomials (5.9). 


Ki Ка к, 
(—3, —1, +1, +3) (+1, —1, -1, +1) (-1, +3, —3, +1) 
Po +344 — 60 +18 
pı +320 — 52 -10 
Sum (рі + ро) +664 —112 +8 
Diff. (py — po) — 24 + 8 —28 


From the swm line, we obtain the 3 components of the K main effect, 
Kı = 51664)? = 2204.5, К, = 4 (112)* = 313.6 
К. = = (8)? = 03 
Similarly, from the difference line, the components of the PK interaction 
are 
PK, = 349024? = 2.9, РК, = 458): = 1.6 
РК. = gs 28? = 3.9 
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The divisors are found from the usual rule for lirear functions (section 
3.42). The three K components add to 2518.4, the total s.s. for K, while 
the PK components add to 8.4, the sum of squares for PK. The detailed 
analysis of variance is shown in table 5.5. 


TABLE 5.5 Мове DETAILED ANALYSIS OF VARIANCE OF THE DATA 


IN TABLE 5.3 
d.f. 8.8. OF M.S. 
n 1 1638.4 
Ki 1 2204.5 
K, 1 313.6 
Ke 1 0.3 
РК, 1 2.9 
РК, 1 16 
PK. 1 3.9 


Since every component has 1 d.f., the mean squares are the same as 
the sums of squares. The error m.s. is 16.35, with 28 d.f. The quadratic 
component of К is significant, indicating a falling off in the response at 
the higher levels of application. Neither the cubic component nor any 
of the interactions approaches significance. 


5.26 А 3? Factorial Experiment. As a second example, the treatment 
totals are shown in table 5.6 for an experiment with 3 levels of nitrogen 
fertilizer and 3 of phosphate fertilizer. "The data are the numbers of 
lettuce plants that emerged from the ground.and are totals over 12 plots 


TABLE 5.6 NUMBERS OF LETTUCE PLANTS EMERGING * 


no т пә Totals 


Po | 449 (Ао) 413 (СВ) 326 (By) | 1188 
Pı 409 (B8) 358 (Ay) 291 (Ca) | 1058 
pa | 941(Cy) 278 (Вә) 312 (АВ) | 931 


Totals| 1199 1049 929 3177 
* The use of the latin and greek letters will be explained later. 


each. Both nitrogen and phosphate appear to have had a deleterious 
effect on emergence (the subscript 2 denotes the largest application). 
The main effects of N and P both comprize two independent com- 
parisons, and thus have 2 d.f. each. Since the amounts of N and P were 
in arithmetic progression, it would probably again be appropriate, as in 
the previous example, to choose the linear and quadratic components of 
the regression on amount of dressing as the individual components of the 
main effects. 
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Тһе N; component can be estimated separately at each level of P. 
This means that there are 2 d.f. which measure the effect of P on the 
linear response to №. These are a part of the NP interaction. Another 
2 d.f. are supplied by the effect of P on the №, component, so that the 
NP interaction contains 4 d.f. 

From inspection of the margins of table 5.6, it is evident that the main 
effects of both N and P are approximately linear. Consequently, the 
most interesting single degree of freedom from the interactions is likely 
to be the interaction of N; with P;. In many agricultural experiments 
this type of interaction is the only one that approaches significance. It 
is worth while calculating this interaction separately in the analysis of 
variance. The other three components, N;P,, М.Р), and М.Р, will also 
be obtained. ‘The initial computations appear in table 5.7. 


TABLE 5.7 CALCULATION OF LINEAR AND QUADRATIC EFFECTS FOR THE 
ANALYSIS OF VARIANCE 


Ni Ne Pi Py 
(-1, 0, 1) а, -2,1) (-1,0, 1) (1, -2,1) 
ро —123 — 51 no —108 —28 
n —118 — 16 n —135 —25 
р» — 29 + 97 та - 14 +56 
Sum —270(N1) + 30(№.) Sum -257(Р) + 3(P,) 
Р, + 94(МР) — -+148(М.Р) № + (Р.Х) T84P4N) 
P, + 84(МРа) + 78(№Ро) № +148(PiN S) +78 (РМ) 


The left side of the table shows е N; апа М, effects for each level of 
P, while the right side shows the P; and P; effects for each level of N. 
For instance, from table 5.6, 


Nip, = 326 — 449 = —123; Ру = 449 — 2(409) + 341 = —28 


The column sums give the individual components of the № and P main 
effects. Now consider the difference between the third and the first row. 
For the М; column, this difference (+94) gives the linear effect of P on 
N: i.e., the М.Р; interaction. From the P; column, the difference gives 
the linear effect of N on Р), or the P;N; interaction, which is exactly the 
same as the N;P; interaction. The other two columns provide the N,P; 
and the N;P, effects. 

Finally, the sum of the first and third rows, minus twice the second 
row, leads to the components of interaction that contain a quadratic 
term. It will be seen that all four components can be obtained from 
either the left or the right half of the table, so that in practice only one 
half is required. By computing both halves we verify the symmetry of 
the components with respect to № and P. 
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The squares of these quantities, with appropriate divisors, will give an 
analysis of variance of the 8 d.f. amongst the treatment totals into 8 
single components. Since an individual entry in table 5.6 is the total 
of 12 plots, the divisors may be verified to be as shown below. 


МогРр NgorPqg МР МІР, М.Р; NP; 


Divisor 72 216 48 144 144 432 


For instance, the divisor for NP, may be worked out as follows. The 
three N, figures in table 5.7 (i.e, —51, —16, +97) each have divisor 
12(12 + 2? + 12), or 72. Since the N,P, total is a linear function of 
these three figures, with coefficients 1, —2, and 1, this total has divisor 
72(12 + 22 + 12), or 432. The analysis of variance is shown in table 5.8. 


TABLE 5.8 SUBDIVISION OF THE TREATMENT 8.5. 


d.f. 8.8. OT M.S. 
Ni 1 1012.50 
№ 1 4.17 
Ві 1 917.35 
Py 1 0.04 
МР, 1 184.08 
МР, 1 49.00 
МР! 1 152.11 
NP, 1 14.08 


The error m.s. in this experiment is about 59. The linear effects of both 
fertilizers are significant, with no indication of curvature. The N:P, ef- 
fect is significant at the 10% level, but not at the 5% level. 

In table 5.6, latin and greek letters were superimposed so as to 
form a3 X 3 graeco-latin square. This square leads to another method 
of calculating the sum of squares for the interactions (4 d.f.). Although 
the method is not likely to be of use for purposes of interpretation, it has 
formed the basis of some ingenious devices in the construction of de- 
signs. In the square, the column totals represent the main effects of N 
and the row totals those of P. Since the latin letter totals are orthogonal 
to rows and columns, it seems reasonable to suppose that they must 
represent two of the 4 components of the МР interaction. Similarly. 
the greek letter totals represent the remaining 2 components. These 
totals are shown below. 


A B с Total а В Y Total 
1119 1913 1045 317 108 1134 1025 3177 
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Each figure is now a total of 36 plots. The sum of squares of deviations 
of the latin letter totals, divided by 36, is 164.22, and the corresponding 
figure for the greek letters is 235.06. These add to 399.28, which is the 
same as the total s.s. for the interactions in table 5.8, apart from round- 
ing differences. 


5.27 General Method of Analysis. Suppose that there are three fac- 
tors, А, В, C, which occur at о, В, and у levels respectively. The main 
effects have (a — 1), (8 — 1), and (у — 1) components or degrees of 
freedom respectively. Each component of the main effect of A can be esti- 
mated separately at each of the В levels of B. Thus each component of A 
contributes (8 — 1) degrees of freedom to the AB interaction. This means 
that the AB interaction contains a total of (о — 1)(8 — 1) components 
or degrees of freedom. Similarly the AC interaction has (æ — 1)(у — 1) 
degrees of freedom, and the BC interaction has (8 — 1)(y — 1) degrees 
of freedom. 

To compute the sums of squares for the main effects and first-order 
interactions, we form two-way tables for each pair of factors. Consider 
the A by B two-way table. The total s.s. among cells has (a8 — 1) de- 
grees of freedom. From the marginal totals in the table we compute 
the sum of squares for the main effect of A, with (о — 1) degrees of free- 
dom, and that for the main effect of B, with (8 — 1) degrees of free- 
dom. By subtraction, the sum of squares for the АВ interaction, with 
(a — 1)(8 — 1) degrees of freedom, is obtained. 

There remains the three-factor, or ABC, interaction. Now each of 
the (a — 1)(8 — 1) components in the AB interaction is estimated sep- 
arately at each level of С. It will therefore contribute (y — 1) degrees 
of freedom to the ABC interaction, so that the latter contains in all 
(a — 1)(8 — 1)(y — 1) degrees of freedom. The sum of squares for ABC 
is also obtained most easily by subtraction. From the total s.s. amongst 
treatments, with (ay — 1) degrees of freedom, subtract the sums of 
squares for A, B, C, AB, AC, and BC. The remainder will be the sum of 
Squares for ABC. 

It is hoped that the reader will find no difficulty in extending these 
methods to the case where there are more than three factors. In gen- 
eral, the sums of squares for main effects are calculated directly, and those 
for interactions are calculated by subtraction. To compute an ABCD 
interaction, for instance, we require a four-way table for the four factors 
represented. From the sum of squares for this table we subtract the sum 
of squares for all main effects and two- and three-factor interactions 
among the factors in question. Calculations must be checked by re- 
computation. Of course, if the experimenter subdivides any interaction 
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into single components as їп the previous section, a check is provided by 
this process. 

In the general case it remains true that any component of a factorial 
effect is orthogonal to any component of any other factorial effect. Thus 
any component of the BC interaction з orthogonal to any component of 
the A main effect, or of the BCD interaction, etc. Two components of 
the same factorial effect may or may not be independent. For instance, 
if A occurs at three levels, the comparison (az — ao) is independent of the 
comparison (аҙ — 2a; + ao), but is not independent of (а — ao). 

The following selected references contain a discussion of factorial ex- 
periments with some worked examples. 


5.8 Yares, Е. The design and analysis of factorial experiments. Imp. Bur. Soil 
Sci. Tech. Comm. 35, 1937. This gives the most comprehensive account that 
is available, with numerous worked examples. 

5.4 Yares, Е. Complex experiments. Jour. Roy. Stat. Soc. Suppl. 2, 181-247, 
1935. Ап earlier reference, with examples of 2°, 23, 4 X 3, and 3° factorials. 

5.5 Davies, О. L. (Ed.). Design and analysis of industrial experiments. Hafner 
Со., New York, 1954. An excellent account of the industrial applications of 
factorial designs. 

5.6 Linguist, E. Е. Design and analysis of experiments in psychology and edu- 
cation. Houghton Mifflin Co., Boston, 1953. Contains a detailed discussion 
of the use and interpretation of factorial experiments in psychology and edu- 
cation. 

5.7 Kempruorne, O. The design and analysis of experiments. John Wiley & 
Sons, New York, 1952. Covers about the same ground as this book, with 
more emphasis on the theory underlying the analysis and on methods of con- 
structing designs. 


5.28 Interpretation of the Analysis: First Example. The separation 
of the treatment comparisons into main effects and interactions is a con- 
venient and powerful method of analysis in cases where interactions are 
small relative to main effects. When interactions are large, this analysis 
must be supplemented by a detailed examination of the nature of the 
interactions. It may, in fact, be found that an analysis into main effects 
and interactions is not suited to the data at hand. There is sometimes а 
tendency to apply the factorial method of analysis mechanically without 
considering whether it is suitable or not, and also a tendency to rely too 
much on the initial analysis of variance alone when writing a summary of 
the results. Below two examples are presented where the initial analysis 
of variance is not very informative, and where the results can be sum- 
marized better in terms of simple rather than factorial effects. 

The first experiment was conducted by the Wailuku Sugar Company. 
Three varieties of sugar cane were compared, in combination with three 
levels of nitrogen (150, 210, and 270 lb. N per acre respectively). The 
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crop was the second harvesting, or the first ratoon crop. In table 5.9 
only the relevant part of the analysis of variance is shown: i.e., the sum 
of squares for the main effects of V and N and for the VN interaction, 
plus the error в.в. The data are іп tons of cane per acre. The conclusions 


TABLE 5.9 ANALYSIS OF VARIANCE OF A 3 X 3 SUGAR CANE EXPERIMENT 


d.f. 8.8. m.s. 
y 2 319.38 159.69 * 
N 2 56.54 28.27 
VN 4 559.79 139.95 * 
Error 24 1053.84 43.91 


from the analysis of variance are that the main effects of V and the УМ 
interaction are both significant, but there is no sign of а main effect due 
to N. This statement tells little about the results of the experiment. 

'The treatment totals (over 4 replications) are shown below with their 
standard errors. Since the standard error per plot is V 43.91 = 6.626, 
the standard error for а, treatment total is 13.3 as shown. 


TABLE 5.10 TREATMENT TOTALS (TONS) IN А 3 X 3 SUGAR CANE EXPERIMENT 


(13.3) 
no т m Total в.е, 
v 266.1 275.9 303.8 845.8 
т 245.8 250.2 281.7 777.7 +23.0 
% 214.4 258.1 231.6 764.1 
Total 786.3 784.2 817.1 2387.6 
ве. +23.0 


Instead of having no effect, nitrogen has apparently given a steady in- 
crease in yields with the first two varieties, but a steady decrease with the 
third variety. Further, the significant main effects of varieties apply 
only to the average varietal yields over all 3 dressings of №, and not to 
yields with a particular rate of dressing. On the average v; gives a sub- 
stantially higher yield than vs, but at the lowest level of М, 0 is slightly 
above о. 

The subsequent analysis may be made either by means of ¢-tests ap- 
plied to table 5.10 or by means of a further subdivision of the analysis of 
variance. It is of interest to examine the response to N for each variety 
separately. To test the linear responses by means of a t-test, we require 
the standard error for (лә — no) as computed for each variety. This 
standard error is 13.3 X 1.414, or 18.8, and since the 5% t-value for 24 
d.f. is 2.064, the quantities (ng — no) must attain the value 18.8 X 2.064, 
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or 38.8, in order to be significant at the 5% level. The actual values are 
(na — пом = +37.7; (п — ту) = +35.9; 

(nz — поз = —42.8 


Thus neither of the increases with v; and v» quite reaches the significance 
level, though both are close to it. The decrease with v3 is significant. 

The test of linearity of the response curves is made by means of the 
quantity (na + по — 2п1), calculated for each variety. The standard 
error of this quantity is 13.3 X +/6, or 32.6. The reader may verify 
from table 5.10 that none of the curvature terms even exceeds its stand- 
ard error. 

It is not quite so clear what tests are appropriate for appraising the 
varietal differences. Since, however, the difference between v; and vs is 
very consistent at all levels of М, a t-test of the total difference is sug- 
gested. This difference is 68.1, and since the value required for signifi- 
cance is 23.0 X 1.414 X 2.064, or 67.1, the superiority of v; over v2 is 
just significant. Further, it is evident on inspection that vg does not dif- 
fer significantly from the other varieties at either of the two lower levels 
of N. At the highest level, v; is significantly below both v; and vs. 

When interactions are large, much care is required in the preparation 
of a statement that summarizes the results, and it is not easy to reach a 
form that is free from criticism. The following is a suggestion. 

“The increase in yield of cane to the highest dressing of N (270 Ib. per 
acre) over the lowest dressing (150 Ib. per acre) was 9.4 tons per acre with 
v; and 9.0 tons per acre with v2, Both increases just failed to be signifi- 
cant at the 5% level. With оз, on the other hand, the highest dressing 
of N decreased the yield significantly by 10.7 tons per acre as compared 
with the lowest dressing. For all three varieties the effects of N appeared 
to be proportional to the amount applied, within the range investigated. 

“Variety 1 gave a higher yield than variety 2 for all levels of N, the 
average difference, 5.7 tons, being just significant at the 5% level. The 
yields for variety 3 did not differ significantly from those of the other 
varieties at the two lower levels of №. At the highest level of №, the yield 
for variety 3 was significantly lower than that for the other varieties.” * 


5.29 Interpretation of the Analysis: Second Example. This example 
is more complex, mainly because the factorial (a 3 X 3) is of an unusual 
type. The data come from a long-term experiment on meadow hay, 

* The explanation for the harmful effect of N with v; is that this variety ripened 
earlier than the others. By the time the experiment was harvested, much cane 
һай fallen to the ground through overripeness on the plots with рз that received М. 


Had the variety been harvested at the optimum time, results would have been 
different. 
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conducted at Lady Manner's School, Bakewell, England, with the co- 
operation of Rothamsted Experimental Station. The yields are for the 
1937 season. The two factors are shown schematically below. 


First factor (3 levels) Second factor (3 levels) 
No manure шаі Хо manure " 
Mixed artificials S ш Алей actificiala ы m 
8 tons compost 8 tons compost 


The nature of the experiment may become clear from a discussion of some 
of the individual treatment combinations. In any replicate there are 
nine plots, of which three received mixed artificial fertilizers in 1937. 
In the previous year, 1936, one of these three plots received no manure, 
one received mixed artificials, and one received compost. Thus these 
three plots enable us to compare the residual effects of the 1936 appli- 
cations of artificials and compost, on plots which received artificials in 
the current year, 1937. Similarly, from other treatment combinations, 
we can compare the residual effects of artificials and compost on plots 
which were unmanured in the current year and on plots which received 
compost in the current year. 

Further, in any replicate there are three plots which were unmanured 
in 1936. Of these, one had no manure, one had artificials, and one had 
compost in 1937. Consequently we may also assess the direct effects of 
artificials and compost applied in 1937 on plots which were unmanured 
in the previous year, and likewise on plots which received artificials or 
compost in the previous year. Both the artificials and compost contain 
the three common plant nutrients, nitrogen, phosphorus, and potash. 

The system of treatments is an ingenious one, designed {0 measure 
direct and residual effects at the same time. The experiment was started 
in 1932, but for simplicity we will ignore any effects of treatments applied 
prior to 1936. 

The treatment means and the analysis of variance are shown in tables 
5.11 and 5.12. The analysis of variance was calculated from the treat- 
ment means, rather than from single plots, and the error m.s. has been 
adjusted so as to apply to a treatment mean. The direct effects are 
highly significant, the interactions are significant at the 5% level, but 
residual effects are not significant. In considering the nature of the ef- 
fects leading to these results, it is again convenient to think in terms of 
simple effects rather than of main effects and interactions. It is worth 
noting that the standard error of a single entry in the two-way table is 
V 6.656, or 2.58, while that for the difference between two entries is 
3.65. 
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With regard to residual effects, table 5.11 shows that compost pro. 
duced a large and highly significant increase, 13.4 cwt. + 3.65, on the 
plots that received no manure in the current year. On plots that re- 
ceived manure in 1937, either artificials or compost, there is no suggestion 
of any residual effect of compost. None of the residual effects of arti- 
ficials approaches the level required for significance. These results are 


TABLE 5.11 TREATMENT MEAN YIELDS (HAY, CWT. PER ACRE) (+2.58) 


Residual effects of 1936 treatments 
Direct effects of ----------с--- 
1937 treatments None Artificials Compost Mean 


None 53.6 56.8 67.0 59.1 
Artificials 80.8 82.3 80.5 81.2 
Compost 74.3 69.1 70.0 71.1 
Mean 69.6 69.4 72.5 70.5 


TABLE 512 ANALYSIS OF VARIANCE OF TREATMENT MEANS 


d.f. 8.8. m.s. 
Direct effects 2 732.28 366.14 ** 
Residual effects 2 18.24 9.12 
Interaction 4 97.01 24.25 * 
Error 24 159.74 6.656 


in accord with general fertilizer experience, since a compost is more 
likely to give residual effects than an inorganic fertilizer, and since re- 
sidual effects would be expected to show up most clearly on plots which 
have no manures during the current year, 

So far as direct effects are concerned, artificials were superior to com- 
post whatever the residual treatment. 


1936 manuring 1987 artificials — 1937 compost (cwt.) 


No manure +6.5 
Artificials +13.2 
Compost +10.5 


The differences among these three figures are within the limits of ex- 
perimental error. To test this point, we find the sum of squares of 
deviations of these differences, which comes to 22.73. This is divided by 
2 to make it comparable with the analysis of variance, giving 11.36, 
which represents two of the 4 d.f. for interactions, The mean square, 
5.68, is slightly below the error m.s. Thus the superiority of artificials 
appears to be consistent. 
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The increases for artificials and compost over no manure are much re- 
duced on plots that received compost in the previous year. 


1936 manuring (1937 artificials — 1937 попе) (1987 compost — 1937 none) 


Compost +13.5 +3.0 
Artificials and none +26.4 +16.5 
(averaged) 


The striking reduction on plots with 1936 compost is simply a reflection 
of the nature of the residual effect of compost as previously examined. 

The conclusions from the experiment might be summarized as follows. 

“Artificials applied in 1937 increased yields by 13.5 ewt. on plots that 
received compost in the previous year, and by 26.4 cwt. on plots that did 
not receive compost in the previous year. The corresponding increases 
due to 1937 compost were 3.0 cwt. and 16.5 cwt., respectively. The su- 
periority of 1937 artificials over 1937 compost, which averaged 10.1 
cwt., appeared to be independent of the type of manuring during the 
previous year. 

“As regards residual effects, compost applied in 1986 increased yields 
by 13.4 ewt. on plots that were unmanured in 1937, but gave no apparent 
increase on plots manured in 1937. There were no significant residual 
effects of 1936 artificials.” 

It seems evident that the significant interaction m.s. in the analysis 
of variance must arise mainly from the fact that compost had a residual 
effect only when no manure was applied currently. As an exercise it may 
be instructive to isolate the part of the interaction s.s. that is due to this 
effect. Since artificials appeared to have no residual effect, we will com- 
bine the 1936 artificials with the 1936 unmanured plots. Consequently, 
the residual effects of compost are obtained from the comparison 


2(1936 compost) — (1936 artificials) — (1936 попе) 


This quantity may be calculated separately for each of the 1937 treat- 
ments. Its values are 


(1937 none) = +23.6; (1937 artificials) = —2.1; 
(1937 compost) = —3.4 


The contrast between the residual effect of compost on plots without 
1937 manures and that on plots with 1937 manures may be estimated 
from the comparison 


2(+23.6) — (—2.1) — (—3.4) = +52.7 
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The square of this quantity, with a suitable divisor, is a single com- 
ponent of the interaction s.s. Since the divisor for the quantities 23.6, 
etc., is 6, the divisor needed is 6 X 6, or 36. Hence the sum of squares 
for this component is (52.7)?/36, or 77.15. The remaining 3 d.f. for in- 
teractions have a sum of squares equal to 19.86. The mean square, 
6.62, is no larger than the error m.s. This verifies the suggestion that 
the significance of the interaction m.s. can be attributed to the type of 
residual effect of compost. 

Some readers may have difficulty in satisfying themselves that the 
component isolated above really is a part of the interaction s.s. It may 
be helpful to express the basic quantity, 52.7, as a linear function of the 
original mean yields in table 5.11. The multipliers of the means will be 
found to be as shown below. 


1937 treatments 1936 treatments 
None Artificials | Compost 


None -2 -2 4 
Artificials 1 1 —2 
Compost 1 1 -2 


The sums of the coefficients are zero over every row and column so that 
the expression is orthogonal to both sets of main effects. Hence it must 
be a part of the interaction. The coefficients also enable us to verify the 
divisor, 36, which is equal to the sum of the squares of the coefficients. 

Since the interactions can be attributed to the behavior of a single 
group of plots (those with compost in 1936 and no manuring in 1937), it 
might be suspected that this treatment had been allotted by chance a 
favorable set of plots. The residual increase, 13.4 cwt., does seem rather 
large in relation to the direct effect of compost. An examination of 
previous results does not lend much weight to this suspicion. Over the 
4 preceding years, the average residual response to compost, with no 
current manure, was 11.1 cwt. In 2 years, 1935 and 1933, the plots in- 
volved in this comparison were the same as those in 1937. For these 
years the average residual effect was 6.2 cwt. In the other 2 years, when 
a different set of compost plots is involved, the average residual effect 
was 16.0 cwt. 

The two preceding examples are intended to illustrate the fact that 
the most informative subdivision of the treatment comparisons depends 
on the type of experiment. An analysis copied from a model that ap- 
pears similar in form may be inappropriate or even meaningless. The 
experimenter should first decide which comparisons are necessary for the 
interpretation of the results. The subsequent analysis, either by ¢-tests 
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or by subdivision of the analysis of variance, should be directed towards 
these comparisons. 

When all the factors represent continuous variables like temperature, 
time, or the amount of an added substance, it may be natural to think 
of the response as a mathematical function of the levels of the factors. 
The type of analysis needed to fit polynomial functions of the first and 
second degree is outlined in chapter 8A. 


5.3 Designs for Factorial Experiments 


5.31 Factorials in Complete Block Designs. Most types of experi- 
mental plan are suitable for factorial experiments. In particular, if the 
total number of treatment combinations is not large, the designs de- 
scribed in chapter 4 are frequently used. The relative advantages of 
complete randomization, randomized blocks, and latin squares are the 
same with factorial as with non-factorial sets of treatments. For il- 
lustration, we show arrangements for a 4 X 2 factorial (i) in 8 random- 
ized blocks and (ii) in an 8 X 8 latin square. In the former case, only 
the first replicate is given. 


TABLE 5.13 FACTORIAL EXPERIMENT ARRANGED IN (i) RANDOMIZED BLOCKS 
AND (ii) A LATIN SQUARE 


8 replieations 8 treatment combinations 
A at 4 levels (1,2,3,4) 
B at 2 levels (1,2) 
1. In randomized blocks Analysis of variance 
Rep. I Replications 7 
12 "Treatment combinations 7 
31 A 3 
B 1 
2 AB 3 
32 Error 49 
2 Total 63 
11 
22 
41 
її. In а latin square Analysis of variance 
42 11 22 12 31 41 32 21 Rowa i 
21 32 31 41 22 12 п 42 STIMMT ast. 7 
12 41 42 11 82 21 22 31 У 3 
31 22 21 42 1 32 41 12 B 1 
82 12 11 31 21 22 42 41 AB 3 
41 42 32 22 12 31 21 11 Ено 2 
11 21 12 32 41 42 31 22 Total 63 
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TABLE 5.14 WEIGHTS OF DENERVATED (y) AND CORRESPONDING NORMAL (г) 


Бер.І 


Length of 


treatment Type of 
(minutes) current 


10) 


20) 


3 (bs) 


5 (bs) 


Rep. II 
1 (61) 


2 (ba) 


3 (bs) 


5 (bs) 


Galvanic 
Faradic 

60 cycle 
25 cycle 


Galvanic 
Faradic 

60 cycle 
25 cycle 


Galvanic 
Faradic 
60 cycle 
25 cycle 


Galvanic 
Faradic 
60 cycle 
25 cycle 


Galvanic 
Faradic 
60 cycle 
25 cycle 


Galvanic 
Faradic 
60 cycle 
25 cycle 


Galvanic 
Faradic 
60 cycle 
25 cycle 


Galvanic 
Faradic 
60 cycle 
25 cycle 


MUSCLE (unit = 0.01 gram) 


Number of treatment periods daily 
One (а) Three (аз) Six (as) 

; ас и. жии асы: —- 
у т у т у т 
72 152 74 131 69 131 
61 130 61 129 65 126 
62 141 65 112 70 111 
85 147 76 125 61 130 
67 136 52 110 62 122 
60 111 55 180 59 122 
64 1% 65 190 64 98 
67 123 72 117 60 92 
57 12 66 132 72 129 
72 165 43 95 43 97 
63 112 66 130 72 180 
56 125 75 130 92 162 
57 121 56 160 78 135 
60 87 63 115 58 118 
61 93 79 1% 68 160 
73 108 86 140 71 120 
46 97 74 131 58 81 
60 126 64 124 52 102 
71 129 64 117 71 108 
53 108 65 108 66 108 
44 83 58 117 54 97 
57 104 55 112 51 100 
62 114 61 100 79 15 
60 105 78 112 82 102 
53 101 50 103 61 115 
56 120 57 110 56 105 
56 101 56 109 71 105 
56 97 58 87 69 107 
46 107 55 108 64 115 
56 109 55 104 57 103 
64 114 66 101 62 99 
59 102 58 98 88 185 
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5.32 Numerical Example: a 4 X 4X 3 Factorial in Randomized 
Blocks. A number of experiments have indicated that electrical stimu- 
lation may be helpful in preventing the wasting away of muscles that are 
denervated. A factorial experiment on rats was conducted by Solandt, 
DeLury, and Hunter (5.8) in order to learn something about the best 
type of current and the most effective method of treatment. The fac- 
tors and their levels are shown below. 


A B с 
Number of treatment Length of treatment Type of current 
periods daily (minutes) 
1 1 Galvanic 
3 2 Faradic 
6 3 60 cycle alternating 
5 25 cycle alternating 


"Treatments were started on the third day after denervation and con- 
tinued for 11 consecutive days. There are 48 different combinations of 
methods of treatment, each of which was applied to a different rat. Two 
replications were conducted, using 96 rats in all. 

The muscles denervated were the gastrocnemius-soleus group on one 
side of the animal, denervation being accomplished by the removal of а 
small part of the sciatic nerve. The measure used for judging the ef- 
fects of the treatments was the weight of the denervated muscle at the 
end of the experiment. Since this depends on the size of the animal, the 
weight of the corresponding muscle on the other side of the body was in- 
cluded as a covariance variate. The data are shown in table 5.14. 

For a covariance analysis we require analyses of variance for both the 
denervated muscle (у) and the normal muscle (х), and the analysis of the 
product (ут). From the discussion in section 5.27 and preceding sec- 
tions, the reader should have little difficulty in completing the analysis. 
The replicate totals, treatment-combination totals, and the grand total 
are first computed. From the treatment totals, the three two-way tables 
are constructed, as given in table 5.15. Since each set of main effect 
totals is obtained twice, this part is self-checking (though only one set of 
main effect totals need be recorded). To illustrate the computations 
made on the A X B table, we have for the analysis of y 


Total s.s. for A X B table 

= $[(610)? + (543)? +-+++ (546)2] — 515 [(6069)?] = 1039 
А = 35 (1936)? + (2028)? + (2105)?] — 3% [(6069):] = 447 
В = 2565)? + (1488)? + (1476)? + (1540)*] — чз [(6069)?] = 223 
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By subtraction, the AB sum of squares is found to be 369. For the sum 
of products, we replace each y? by the corresponding yz product. Thus 
Total s.p. = 41(510) (1030) + (543) (977) ------- (546) (985)] 

— 56069) (11,307) = 1151 
Finally, the ABC sum of squares is found by calculating the total в.5. 


among all 48 treatment combinations, and subtracting the sum of squares 
for A, B, C, AB, AC, and BC. 


TABLE 5.15 Two-way TABLES IN A 4 X 4 X 3 FACTORIAL 


y т 
ay ag а Total ay аз а Total 
B by 510 543 512 | 1,565 1,030 977 897 | 2,904 
Length of bs 481 496 511 | 1,488 902 1,038 848 | 2,788 
treatment b3 469 471 536 | 1,476 941 896 1,000| 2,837 
bs 476 518 546 | 1,540 841 952 985 | 2,778 
с с 442 485 518 | 1,445 917 992 925 | 2,834 
Type of Е 482 453 441 | 1,376 952 969 873 | 2,794 
current 60 | 503 522 557 | 1,582 930 985 976 | 2,891 
25 509 568 589 | 1,666 915 917 956 | 2,788 
A totals 1,936 2,028 2,105 | 6,069 3,714 3,863 3,730 | 11,307 
y ж 
ы b b |ы b b b 
с С | 393 937 359 356 | 723 665 700 746 


Type of Е | 363 337 327 349 | 737 729 692 636 
current 60 | 403 395 384 400/718 743 737 693 
25 | 406 419 406 435 | 726 651 708 703 


The analyses of 92, yz, and 27 are given in table 5.16. The regres- 
sion coefficient of weight of denervated on weight of normal muscle is 
3977/16,013, or 0.248361. The next step is to form the analysis of 
(y — bz)?. This is done by adding to any y? value (—2b) times the cor- 
responding yz value, plus (b?) times the corresponding 22 value. The 
mean squares for (y — bx) are the quantities used to test the significance 
of treatment effects on the weights of denervated muscles, adjusted for 
their regression on the weights of the normal muscles. As shown in 
section 3.86, the treatment m.s. is slightly inflated because of the sam- 
pling error of the regression coefficient. The inflation is sufficiently 
small, however, that only effects which appear to be on the borderline of 
significance need be tested by the exact method (section 3.86), which is 
more laborious to calculate. 
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То the analysis of variance (table 5.16) only two factorial effects are 
significant: those for the main effects of C (type of current used) and for 
the main effects of A (number of treatment periods daily). The AC in- 
teraction m.s. is somewhat higher than the error m.s., but does not ap- 
proach the 5% significance level. These results suggest that in preparing 


TABLE 5.16 ANALYSES OF VARIANCE 


№ yr 2? (y — bx)? 
d.f. 8.8. в.р. 3.8. 8.8. m.s. 

Replications 1 605 2,508 10,854 0 
A (number of treatments) 2 447 64 418 441 220.5 * 
B (length of period) 3 223 137 415 181 60.4 
С (type of current) 3 2,145 104 281 2,111 703.7 ** 
AB 6 369 950 5,202 218 36.3 
АС 6 645 476 1,014 4711 78.5 
BC 9 299 11 2,015 418 46.4 
АВС 18 1,050 666 5,198 1,040 57.8 
Error 47: 3,199 3,977 16013 221 48.1 

Total 952 8,982 8,888 40,910 7,91  ..... 


146 d.f. for (y — bz). 
294 d.f. for (y — бе), 


Nore. In reference (5.8), the sums of squares for AC and АВС are in error. 


summary tables for further examination and for presentation of the 
results, we probably need only the AC two-way table. The averages 
for B (length of an individual treatment) should also be considered in 
case they indicate a trend effect that was not marked enough to attain 
significance. These data appear in table 5.17. 


TABLE 5.17 WEIGHTS OF DENERVATED MUSCLE, ADJUSTED FOR REGRESSION ON 
WEIGHT OF NORMAL MUSCLE (unit = 0.0] gram) 
AC two-way table (s.e. +2.47) 
Number of treatments 


B means а аз ag |Means 
b; 64.4 
be 62.4 G | 56.0 59.1 65.2 | 60.1 
b; 614 — Type of Е | 60.0 55.8 57.3 | 57.7 Т? 
% 64.7 current, 60 | 63.2 63.9 68.6 | 65.2 кеке 
ве. +1.43 25 | 64.5 71.8 73.2 | 69.8 
Means 60.9 62.6 66.1 | 63.2 
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The means have been adjusted in the usual way for the regression 
on the weight of the normal muscle. Because of this adjustment, every 
mean has a slightly different standard error. However, as pointed out 
in section 3.85, it is sufficiently accurate to use an average standard 
error. This is computed from the effective error m.s., which is 48.1 Х 
1.019, or 49.0, as described in section 3.85. Thus the standard error for 
an entry in the two-way table is /49.0/8, or 2.47. 

The 25 cycle alternating current gave a significantly higher mean 
weight than any other type of current. The 60 cycle alternating current 
was superior to both galvanic and faradic currents, the last two not being 
significantly different. The weights increased as the number of treat- 
ment periods daily increased. Оп inspection the increase appears ар- 
proximately linearly related to the number of treatment periods. 

To test this supposition, we may fit a regression of the adjusted A 
totals on the number of treatment periods (the adjusted А means do not 
carry enough decimals to check with the analysis of variance). The 
relevant data are shown below. 


Adjusted 2 = number 
A totals of periods 


о 1950 1 
аз 2005 Б] (a —а)(2 — 2 = 421.667 
а 2115 6 (2 — 3? = 12.667 


By the usual formula, the contribution of the regression to the sum of 
squares for A is (421.667)2/(12.667) (32), or 439. Тһе last divisor, 32, is 
required because the A totals are totals over 32 rats. The sum of squares 
for regression accounts for practically all the sum of squares for A (441), 
so that there is no indication of deviation from linearity, The regression 
coefficient for a single rat is 421.667/(12.667) (32) or 1.0 unit. The con- 
clusion is that, within the range of periods tested, each additional treat- 
ment period per day increased the weight by 1 unit. 

Тһе В means in table 5.17 do not indicate any consistent effect of 
length of an individual treatment period (1 to 5 minutes). Тһе major 
part of the AC interaction seems to come from the anomalous behavior 
of the faradie current, which produced a drop in weight from a; to аз 
and ag. Since the interaction as a whole was not significant, it does not 
seem worth while to examine the statistical significance of this effect. 


5.33 Other Designs. The total number of treatment combinations in- 
creases rapidly as the number of factors or the number of levels of a fac- 
tor is increased. In this event the latin square necessitates an amount 
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of replication that is usually impracticable, while it becomes difficult to 
assemble homogeneous replications for a randomized blocks arrange- 
ment. Consequently, the experimental error per unit tends to increase. 
In order to avoid this increase in error, a number of designs have been 
produced. The basic principle, which is common to all these designs, is 
the use of a “block” that is smaller than a complete replication. That is, 
the arrangements are such that the differences among these “incom- 
plete” blocks are eliminated from error, just as the differences among 
replicates are eliminated from error in a randomized blocks design. Un- 
fortunately, as will be explained later, this reduction in block size can be 
accomplished only by the deliberate sacrifice of accuracy on certain 
treatment comparisons. The designs fall into three main groups ac- 
cording to the particular treatment comparisons that are sacrificed in 
this way. 

In the first growp, the treatment comparisons that are sacrificed are 
the high-order interactions. These designs are appropriate in lines of re- 
search where experience has shown that high-order interactions are 
nearly always negligible. The designs available in randomized incom- 
plete blocks are described in chapter 6 and those which can be placed in 
latin squares in chapter 8. 

In the second group, known in agriculture as split-plot experiments, a 
factor, or a group of factors and their interactions, are sacrificed. These 
designs, which are very widely used, are discussed in chapter 7. Finally, 
in certain cases it is possible to construct designs so that the loss in ac- 
curacy is spread evenly over all factors and their interactions. Within 
this group there are several types: balanced lattices (chapter 10), bal- 
anced incomplete blocks (chapter 11) and balanced lattice squares 
(chapter 12). In cases where the effects of the factors and the sizes of the 
interactions are rather unpredictable, these designs may be suitable. 
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СНАРТЕВ 6 


Confounding 


6.1 The Principle of Confounding 


6.11 The 2° Factorial Experiment with Complete Confounding. This 
chapter deals with designs where the size of block is reduced by the 
sacrifice of accuracy on certain high-order interactions. The method 
will be illustrated first for a 2° experiment with three factors, A, B, and 
C, each at two levels. Since there are only 8 treatment combinations, 
the replication is not particularly large and in practice this experiment 
would most frequently be arranged in ordinary randomized blocks or 
perhaps in an 8 Х 8 latin square. The example is chosen because of its 
simplicity. Г 

The interaction of highest order is the ABC interaction. It will be 
recalled (section 5.23) that this interaction is estimated from the com- 
parison 


(abe) + (a) + @) + (с) - (ab) - (ас) — (іс) - (1) 


Suppose that each replicate in the experiment is divided into 2 blocks of 
4 units each, such that one block contains abc, а, b, and с, while the other 
contains ab, ac, be, and (1). With 3 replicates the plan (before ran- 
domization) would be as follows. 


TABLE 6.1 23 EXPERIMENT IN BLOCKS OF 4 UNITS, WITH АВС CONFOUNDED 


Rep. I Rep. II Rep. III 
Block 1 2 3 4 5 6 
abe ab abc ab abc ab 
a ac a ac a ac 
b bc b bc b bc 
а) с @) с (1) 


There are two important properties of this plan. The total of blocks 

1, 3, and 5, minus the total of blocks 2, 4, and 6, is the ABC interaction 

total. Thus the ABC interaction is one of the components of the com- 
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parisons amongst blocks. It is said to be completely confownded with 
blocks. Secondly, the other six factorial effects, A, B, C, AB, AC, and 
BC, will all be found to be orthogonal with the block totals. For in- 
stance, the AB interaction may be written (apart from the divisor) 


[(abe) + (с) - (a) — ()] + [(ab) + (1) - (ac) — (be)] 


Of the 4 units in any block, two carry a (+) sign in this expression and 
two carry a (—) sign. Consequently, if we increase all the observations 
in a selected block by any amount, say 50, the estimate of AB remains 
unchanged, and similarly for the other 5 factorial effects. This means 
that these 6 factorial effects are not influenced by differences amongst 
blocks. Various phrases are used to describe this property: the effects 
may be said to be unconfounded with blocks, or free from block effects, 
or to be composed entirely of within-block comparisons. 

Thus differences amongst blocks of 4 units are eliminated from the ex- 
perimental errors of the main effects and two-factor interactions, whereas 
with randomized blocks only differences amongst blocks of 8 units are 
eliminated. The reduction in effective block size is attained by making 
ABC the same as one of the block comparisons. There is no within- 
block information available about АВС. 

The degrees of freedom in the analysis of variance separate as follows. 


d.f. 


Blocks 5 
А, B, C, AB, AC, BC 6 
Error 12 

Total 23 


АП sums of squares are calculated in the usual way, so that there is no 
complexity in the computations. 

The composition of the error s.s. is perhaps worth noting. Consider 
the 3 blocks (1, 3, and 5 in table 6.1) that contain the treatments абс, а, 
b, and c. "These may be regarded as a randomized blocks experiment 
with 4 treatments and 3 blocks. Consequently the interaction of treat- 
ments with blocks contains six components. Similarly in blocks 2, 4, 
and 6 the interaction of the other set of treatments with blocks provides 
six components. "These two sets of six components constitute the error 
for the complete experiment. In other words, the error term in а con- 
founded factorial is made up of interactions between treatments and in- 
complete blocks. This remains true in the more complex designs that 
appear later in this chapter, even though in these cases the composition 
of the error is less easy to detect. 
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ABC does not appear explicitly in the analysis above. Actually, the 
experiment, if properly randomized, provides an estimate of error and a 
test of significance for ABC. In practice it is seldom worth while making 
this test, which is usually very insensitive. The test may, however, 
throw additional light on the nature of confounding. Let us ignore all 
other factorial effects and consider table 6.1 as the plan of an experiment 
to determine ABC, i.e., to determine the difference between the group 
of treatments abc, а, b, c and the group ab, ac, be, (1). From this point of 
view the block becomes the “experimental unit" and the experiment is 
of the ordinary randomized blocks type, having 2 “treatments” and 3 
replicates. The total s.s. among the 6 “units” is, of course, the blocks s.s. 
in the previous analysis. Thus the blocks s.s. may be subdivided into: 


d.f. 
Replicates 2 
ABC 1 
Error for ABC 2 
"Total 5 


If confounding has been effective, the crror for ABC, being composed of 
comparisons among blocks, will be larger than the error (with 12 а.) 
which applies to the rest of the experiment. Moreover, the error for 
ABC is estimated from only 2 d.f., so that the test of ABC is a poor one. 
The test might be worth making with a considerable number of experi- 
ments of the same type, where it is desired to examine the average ABC 
effect over the whole group. 


6.12 The 2° Factorial Experiment with Partial Confounding. Any of 
the seven factorial effects may be confounded with blocks in this way. 
The rest will then be free from block effects. These facts enable us to 
spread the confounding in an experiment among several factorial effects. 
A plan of this type is shown in table 6.2. 


TABLE 6.2 2 EXPERIMENT IN BLOCKS OF 4 UNITS, wrrH ABC, AC, BC 
PARTIALLY CONFOUNDED 


Rep. I Rep. II Rep. III 

Block 1 2 3 4 5 6 
abc ab abc ab abc ab 

a ac ac bc bc ac 

b bc b a a b 

c a) (1) c (1) с 


АВС Ас BC 
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The difference between the totals of blocks 1 and 2 represents the ABC 
interaction. In replication II, however, the block composition has been 
changed so that the difference between the block totals is the AC inter- 
action. In replication ІП the difference is the BC interaction. 

With this plan, A, B, C, and AB are entirely free from block effects. 
ABC is completely confounded with blocks in replicate I, but in the other 
two replicates the estimate of ABC is orthogonal with blocks. Thus a 
**within-block" estimate of ABC may be obtained from replicates II and 
ІП. Similarly, a “within-block” estimate of AC is available from rep- 
licates I and III, and one of BC from I and II. These three effects are 
partially confounded with blocks, since an estimate that is free from 
block effects can still be made for each effect. Each ''within-block" 
estimate is derived from 2 out of the 3 replicates in the experiment. The 
ratio 24 serves as а measure of the extent of the confounding. Yates 
(6.1) calls this ratio the relative information on the confounded inter- 
action. The ratio gives the amount of information available on the 
partially confounded effect, relative to that available on an uncon- 
founded effect. 

In the analysis of variance, all factorial effects appear. 


d.f. 

Blocks 5 
А 1 
В 1 
с 1 
АВ 1 
АС 17 
BC 1 
АВС T 
Error 11 
Total 23 


The sums of squares for blocks and for the unconfounded effects А, 
B, C, and AB are found in the usual way. The sum of squares for AC is 
calculated only from replicates I and III, the divisor for [АС]? being 16 
instead of 24. Corresponding rules apply to BC and ABC. In the analy- 
sis of variance, the primes are inserted as a reminder that these effects 
are partially confounded and require special calculations. 


6.13 Confounding in the 2” Series. In order to use confounded de- 
signs intelligently, it is not necessary to understand in detail the methods 
by which they are constructed. The rules for the construction of con- 
founded 2” factorials will be briefly described, however. since they are 
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fairly simple and may help to explain the choice of plans presented at 
the end of this chapter. 

If a replicate is to be divided into 2 blocks, any factorial effect can be 
confounded with blocks. In most cases we would confound the highest- 
order interaction in all replicates. Suppose that each incomplete block 
is to be further divided into 2, so that there will be 4 blocks (Т... IV) 
in a replicate. Having chosen one factorial effect to represent the com- 
parison (I + II — Ш — IV), we may easily verify that any other fac- 
torial effect may be made to represent the comparison (I — II + III — 
IV). That is, to arrange a 2" factorial in blocks of 2"? units, we may 
confound any two factorial effects that we choose. 

The remaining block comparison (I — II — III + IV) represents a 
third factorial effect, which is also confounded with blocks. Barnard 
(6.2) has shown that this effect is uniquely determined by the two ef- 
fects that were chosen. It is always their "generalized interaction," and 
is found by combining all the letters that appear in the two chosen fac- 
torial effects, and cancelling all letters that enter twice. Thus, if a 25 
factorial is to be confounded in blocks of 2°, or 8, units, we might be in- 
clined to choose ABCDE and BCDE as the first two factorial effects to 
be confounded. The generalized interaction of these two effects is 
ABBCCDDEE or, after cancellation, is А. Consequently, if these two 
effects are chosen for confounding, the main effect of A is also automati- 
cally confounded. These results may be summarized as follows. “If a 
replicate of а 2" factorial is arranged in blocks of 27-2 units, three fac- 
torial effects are confounded with blocks. Of these, two may be chosen 
at will: the third is their generalized interaction.” 

To proceed, a plan with 8 blocks in the replicate can be obtained from 
one with 4 blocks in the replicate by dividing each block into two parts. 
In making this division we may confound any factorial effect, except one 
of the three that is already confounded. Thus, in the 25 example, where 
ABCDE, BCDE, and A were confounded in the division into 4 blocks, 
we might choose say ACE for the division into 8 blocks. But with 8 
blocks in the replicate, there are seven components of the comparisons 
among blocks, and each of these must represent one factorial effect. 
Непсе, in addition to the four factorial effects that are known to be соп- 
founded, three others are confounded. These three are the generalized 
interactions of ACE with ABCDE, BCDE and A, or BD, ABD, and CE, 
respectively. 

The general rule should be clear from this case. Suppose that a rep- 
licate in a 2" factorial is to be divided into blocks of 2"^* units, so that 
there will be 2* blocks in the replicate. Then we may select any k fac- 
torial effects to be confounded, subject only to the restriction that none 
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of these must be a generalized interaction of any group of the others. A 
further (2* — k — 1) effects are automatically confounded. These аге 
all the effects which can be expressed as generalized interactions of the 
group of effects selected for confounding. 

In the construction of designs for practical use, it follows that the ef- 
fects to be confounded must not be selected without examination of the 
other effects that are automatically confounded. If confounding is to be 
restricted to high-order interactions, we cannot choose ABCDE and 
BCDE in a 25 factorial, since A would then be confounded. In order to 
avoid confounding any main effects or first-order interactions in this case, 
а little trial will show that the only possible choice is two second-order 
interactions and one third-order interaction: for instance ABC, ADE, 
and their generalized interaction BCDE. The plans at the end of this 
chapter contain what appear to be the best choices for general use. 

When the interactions to be confounded have been selected, there re- 
mains the problem of writing out the plan showing the treatment com- 
binations that appear in each block. This may be done by the procedure 
used in the present discussion. The replicate is first divided into two by 
confounding one effect, then into four by confounding a second, and so 
on. The rules which express each factorial effect in terms of the original 
treatment combinations (section 5.24) decide the composition of the 
blocks at each stage. For experiments with a large number of factors, 
arranged in blocks of small size, Fisher (6.3) has given an alternative 
method that is more expeditious [see also Finney (6.4)]. In this method 
a group of 2” letter combinations is constructed, such that the gen- 
eralized interaction of this group with any treatment combination gives 
the members of the block that contains this treatment combination. 

Fisher (6.3) has examined the general problem of confounding when 
the object is to keep all main effects and first-order interactions com- 
pletely clear of block effects. He has shown that experiments up to the 
27 factorial (128 treatment combinations) may be placed in blocks of 8 
units, while blocks of 16 units can accommodate experiments up to the 
215 factorial (32,768 treatment combinations). 

The preceding discussion applied to a single replication. With more 
than one replicate we may repeat the first replicate, in which case the 
confounding is complete for the interactions involved. Alternatively, 
the effects that are confounded may be changed from one replicate to 
the next, so that some intra-block information is available on all effects. 


6.14 Example of a 2* Factorial Confeunded in Blocks of 8 Units. 
Table 6.3 contains the plan and yields of a 2* field experiment on beans 
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TABLE 6.3 PLAN AND YIELDS (BEANS IN POUNDS) OF A 2* FACTORIAL 
EXPERIMENT 


Rep. II 


Block a 
351 


Block b 
395 


conducted by the Rothamsted Experimental Station in 1936. The fac- 
tors were 


Dung (D): none, 10 tons per acre. 

Nitrochalk (М): none, 0.4 cwt. М per acre. 
Superphosphate (P): none, 0.6 cwt. Р.О per acre. 
Muriate of potash (К): none, 1.0 cwt. К2О per acre. 


With 16 treatment combinations in blocks of 8 plots, only one factorial 
effect is confounded in each replication. From the plan it will be seen 
that DNPK, the highest-order interaction, was confounded in each of 
the two replicates. The computations proceed as follows. 

Step 1. Calculate the totals for each treatment combination, the 
block and replicate totals, and the grand total. The total s.s. and the 
sums of squares for replicates and. for blocks within replicates should 
present no difficulty. 

Step 2. This deals with the computation of the factorial effect totals. 
Write down each factorial effect total as a linear function of the treat- 
ment combination totals, as shown in table 6.4. The treatment com- 
binations and their total yields are placed in a systematic-order in the 
first two columns, and the remaining columns give the factorial effects. 
Although the writing of the table requires some time, the procedure is 
quite simple. When the columns for D and N have been written down, 
the column for DN is obtained as the product of corresponding signs in 
the D and N columns. Similarly, later in the table DNP may be ob- 
tained as the product of the signs of D and NP, or of N and DP, and so 
on. Thus it is necessary to know only the expressions for the main ef- 
fects. 

The factorial effect totals are shown at the bottom of each column. 
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The total for DNPK is not required, since owing to the confounding 
DNPK will not appear explicitly in the analysis of variance. The ДИРК 
column, however, allows a check to be made, as follows. If the 15 totals 
are added, it may be verified by inspection of table 6.4 that their sum, 
+66, should equal 16 times the (dnpk) treatment total, minus the grand 
total, 1.е., 16(98) — 1502 = +66. Alternatively, Yates’ automatic 
method of obtaining the factorial effect totals (section 5.24a) can be 
used in this step. 

Step 8. The contribution of each factorial effect to the treatments 
s.s. in the analysis of variance is now obtained. Since there are 32 plots, 
the square of each effect is divided by 32. The analysis of variance ap- 
pears in table 6.5. Note that DNPK is omitted, since it already appears 


TABLE 6.5 ANALYSIS OF VARIANCE 


d.f. 8.8. m.s. 
Replications 1 3.1 
Blocks in reps. 2 123.2 61.6 
D 1 2.0 
N 1 325.1 ** 
Р 1 6.1 
K 1 4.5 
DN 1 32.0 
DP 1 242.0 ** 
NP 1 78.1 
DK 1 6.1 
NK 1 32.0 
PK 1 24.5 
DNP 1 2.0 
DNK 1 10.1 
DPK 1 15.1 
NPK 1 32.0 
Error 14 340.0 24.29 
Total 31 1277.9 


in the blocks s.s. Аз a check, the total of the 14 treatment s.s., 811.6, 
plus that for РУРК, 78.1, should equal the treatments s.s. as found in 
the ordinary way from the totals for the 16 treatment combinations. 
"The error s.s. is found by subtraction. 

Only two effects are significant: the main effect of nitrochalk, which 
produced a depression of the yield, and the DP interaction. Since 
neither D nor P produced significant main effects, it is advisable not to 
lay much stress on the DP interaction in the absence of other confirm- 
atory evidence. 
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Step 4. The best presentation of the results will depend on their 
nature and on the audience for whom they are intended. Table 6.6 gives 


TABLE 6.6 DIFFERENTIAL RESPONSES (POUNDS PER PLOT) 


Response with 
Mean Я 
Factor Жарооз Dung (D) Nitrochalk (№) | Super. (P) Mur. pot. (K) 
Abs. | Pres. | Abs. | Pres, | Abs. | Pres, | Abs. | Pres. 
Dung (D) SEO РР з», xe —2.5 | +1.5 | -6.0 | +5.0 | +0.4 | —1.4 
Nitrochalk (№) 204. | =8.4] —4.4]|.............. —9.5 | 3.2 | —4.4 | —8.4 
Super. (Р) «49.9 | —4.6| +6.4 | —2.2 | $4.0 |.......|....... -0.9 | +2.6 
Mur. pot. (К) —0.8 | *0.1| —1.6 | +1.2 | -2.8 | —2.5 | +1.0 |.......|....... 


в.е.: +2.46 for differential response; +1.74 for mean responses. 


a concise presentation of the main effects and two-factor interactions in a 
form that has been used frequently at Rothamsted Experimental Sta- 
tion. The response to each factor is shown separately for each level of 
every other factor. Thus the row labelled *Dung" contains the mean 
response and the differential responses to dung. The figure —2.5 (re- 
sponse to dung with nitrochalk absent) is the average response to dung 
over all plots that did not receive nitrochalk. The table enables a quick 
appraisal to be made of the nature of the main effects and two-factor 
interactions. In the present case, of course, the table is scarcely needed 
because of the dearth of effects. 

The data in the table are easily calculated from the factorial effect 
totals in table 6.4. For instance, 


Mean response to dung = 1%] = — 8; = -0.5 
Mean response to dung 

(nitrochalk absent) = 7g {[D] — [DN]] = -4% = —2.5 
Mean response to dung 

(nitrochalk present) = 355([D] + [DN]] = +34 = 41.5 


These relations may be verified from the signs in the columns in table 
6.4. 

The standard error per plot is 4/24.29, or 4.93. Since each differential 
response in table 6.6 is the difference between the means of two groups 
of 8 plots, its standard error is (4.93)/2, or 2.46. The standard error for 


1 
а mean response is — times this value, or 1.74. 


4/2 
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6.15 Confounding of a 3? Factorial. Let the symbol ij denote the 
treatment combination that has the ith level of A and the jth level of 
В (i,j = 0, 1, 2). As before, we wish to keep the main effects clear of 
block effects, that is, to confound only the AB interaction. Since there 
аге 9 treatment combinations in a replicate and since 3 is the only factor 
of 9, the size of the incomplete block must be 3 units. 

We have seen (section 5.26) that the main effects of A and B both con- 
tain two components, while the AB interaction has four components. 
Further, the main effects and the interaction can be divided into single 
components in various ways, and the division that is most appropriate 
for interpretation of the results will change from experiment to experi- 
ment. This suggests that the particular components of the interaction 
which we should desire to confound will vary from case to case. Actu- 
ally, the possibilities for confounding are much more restricted with the 
3” than with the 2” system: in fact, only one set of components of AB 
lends itself readily to confounding. 

The confounding is based on the properties of the 3 X 3 graeco-latin 
square. Suppose that the 9 treatment combinations are set out as in 
table 6.7, on which a 3 X 3 square is superimposed. Following the nota- 


TABLE 6.7 Use оғ А 3 X 3 GRAECO-LATIN SQUARE TO OBTAIN THE АВ 
INTERACTION 


bo b b 


в | (007: (01)7372 (02) 157 
a | (10152 (11) 773 (12)7,711 
a» | (20153 (211571 (22)hJa 


tion used by Yates (6.1), we denote the latin letters in the square by 
Т,,* I», and Is, respectively, and the greek letters by Ji, Jo, and Уз, 
respectively. 

It is clear that comparisons among the row totals of the square give 
the two components of the main effect A, while the comparisons among 
the column totals give the main effect of B. Now consider the J totals: 


I, = (00) + (11) + 02; 1:- (10) + (21) + (02); 
I, = (20) + (01) + (12) 


By the basic property of the latin square, comparisons among these 
totals are orthogonal to both rows and columns: that is, to the main 
effects of A and B. Hence, the comparisons among the J totals must 


* In this edition the J and J values agree with those used by Yates. 
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represent two of the four components of the AB interaction. The same 
argument shows that the two remaining components of AB are derived 
from the comparisons among the J totals, where 


Л = (00) + (21) + (12; Ja = (10) + (01) + (22); 
Js = (20) + (11) + (02) 


A numerical verification of this fact was made at the end of section 5.26, 
where the sum of squares for AB was calculated from the latin and greek 
letter totals which correspond to the J and J totals. 

The application of this result to confounding is shown in table 6.8. 


TABLE 6.8 3 X 3 EXPERIMENT WITH AB PARTIALLY CONFOUNDED 
Plan (i) Plan (ii) 
Incomplete blocks Incomplete blocks 


I components confounded J components confounded 


There are two possibilities. In plan (1) the Г components of AB are 
completely confounded with incomplete blocks, since the block totals 
have been made the same as the J totals. The main effects and the J 
components of AB are unconfounded. In plan (ii), the J components 
of AB are completely confounded. It should be noted that neither the 
I nor the J components are easy to interpret in terms of the results of 
an actual experiment. They are selected for confounding because they 
are convenient for this purpose. 

If the number of replicates in an experiment is even, we may use plan 
(i) in half the replicates and plan (ii) in the other half. With this ar- 
rangement the 7 components may be estimated (clear of block effects) 
from those replicates in which plan (ii) is used, and vice versa for the J 
components. The relative accuracy is 14 for each set of components. 
It may be shown that the relative accuracy is also 1 for any component 
of AB in which we may be interested. The confounding is said to be 
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balanced with respect to AB. Balanced confounding is preferable, for 
there is no reason to confound J more heavily than J. 

This design has limited practical utility. It may be used where the 
factors are known to operate independently, so that AB will be neg- 
ligible. If information about AB is wanted, a randomized blocks design 
is advisable unless there will be a substantial reduction in error variance 
from the use of incomplete blocks. The same principle of construction, 
however, leads to a very useful design for the 3° factorial. 


6.16 Confounding of a 3° Factorial. The same notation will be used. 
Thus, (021) denotes the treatment combination agboc;. Since there are 
27 treatment combinations, the possible sizes of incomplete block are 
3 and 9 units. The plan for blocks of З units necessitates confounding of 
the two-factor interactions and is not given here. With 9 units per 
block, only ABC need be confounded. In the case of the 3? factorial we 
were able to divide the 9 treatment combinations into groups of three 
(the 7 and J groups), such that comparisons among the group totals 
gave the components of AB. We shall show that the 27 treatment com- 
binations in a 3° factorial can be divided into groups of nine such that 
comparisohs among the group totals give the components of ABC. Each 
set of three groups will contribute two components of ABC. Since ABC 
has in all eight components or degrees of freedom, there will be four such 
sets. 

The AB interaction can be calculated separately for each level of C. 
As before, we will obtain the AB interaction from its Г and J compo- 
nents. Table 6.9 shows the J components for each level of C. 


TABLE 6.9 Tue 1 COMPONENTS ог AB SHOWN FOR EACH LEVEL OF C 


Co Сі С 


1 | (000) + (110) + (220) | (001) + (111) + (221) | (002) + (112) + (222) 
I» | (100) + (210) + (020) | (101) + (211) + (021) | (102) + (212) + (022) 
1 | (200) + (010) + (120) | (201) + (011) + (121) | (202) + (012) + (122) 


We may regard table 6.9 as a 3 X 3 table, in which each entry is the 
total of three treatment combinations. The row totals of the table give 
the J components of AB, while the column totals give the main effect of 
C. Further, we may take J and J totals from this table just as in table 
6.7. These totals will be called Z — I, I — Jı, ete., since they come 
from the J components of AB. Thus 
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I — 1: (000) + (110) + (220) + (101) + (211) + (021) + (202) + (012) + (122) 
1- 1»: (100) + (210) + (020) + (201) + (011) + (121) + (002) + (112) + (222) 
I — Is: (200) + (010) + (120) + (001) + (111) + (221) + (102) + (212) + (022) 


By the same argument as for the 3 X 3 design, the comparisons among 
these totals of 9 treatment combinations must represent two of the com- 
ponents of the interaction of AB with C; that is, of the ABC interaction. 

Consequently if we put all treatment combinations in J — J, into the 
first block, those in J — 1; into the second, and those in J — 1 3 into the 
third, we have a plan which completely confounds two of the eight com- 
ponents of ABC, and leaves all other factorial effects unconfounded. 
This arrangement appears as replication I in plan 6.7 for the 3? fac- 
torial at the end of this chapter. 

"The second set of three groups of 9 treatment combinations is ob- 
tained by taking the J totals from table 6.9. 


I — Jy: (000) + (110) + (220) + (201) + (011) + (121) + (102) + (212) + (022) 
I — Ja: (100) + (210) + (020) + (001) + (111) + (221) + (202) + (012) + (122) 
I — Л: (200) + (010) + (120) + (101) + (211) + (021) + (002) + (112) + (222) 


This grouping constitutes replication II in plan 6.7. The remaining sets 
are obtained by forming a 3 X 3 table similar to table 6.9 for the J com- 
ponents of AB at each level of С. The reader may verify that the 
J — L, J — Iz, and J — Із groups are as shown in replicate III, and 
that the J — Jı, J — Js, and J — Jz groups are as shown in replicate 
IV of plan 6.7. 


6.17 Example of a 33 Factorial Confounded in Blocks of 9 Units. 
This experiment, conducted by the Seed Laboratory, Iowa State Col- 
lege, tests the effects of 3 levels of nitrogen, 3 of phosphorus, and 3 of 
potash on the germination of lettuce seedlings. The seed was thoroughly 
mixed, and divided into 108 samples of about 60 seeds each. Each 
sample was planted in a copper box, 6 in. square and 114 in. deep, іп a 
mixture of soil and sand. The boxes were placed in a germinator, at à 
temperature of about 32°C. At the end of 5 to 7 days the seedlings 
were classified as normal, abnormal, hard, or dead. The data in table 
6.10 show the numbers of normal lettuce plants. 

There were 4 replications, each placed on a different shelf in the ger- 
minator. On a shelf the boxes were placed in 3 columns of 9 boxes each, 
each column being an incomplete block. It will be noted that all 3 fer- 
tilizers had a deleterious effect on emergence. 
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The computations proceed as follows. 


Step 1. Form the block and replicate totals and the grand total 
(shown in table 6.10) and also the totals for each treatment combination 
and the three two-way tables (shown in table 6.11). 


Step 2. These data enable us to calculate the total s.s. and the sum 
of squares for replications, blocks within replications, and for the М, 
P, K, NP, NK, and PK factorial effects. All are obtained in the usual 
way and are entered in the preliminary analysis of variance (table 6.12). 


TABLE 6.12 ANALYSIS OF VARIANCE FoR А 8 X 8 Х 3 FACTORIAL 


d.f. 8.8. m.s. 

Replications 3 2,041.88 
Blocks within replications 8 5,008.15 626.02 

N 2 1,016.67 508.34 ** 

P 2 917.39 458.70 ** 

K 2 293.39 146.70 

NP 4 399.27 99.82 

NK 4 589.61 147.40 

PK 4 212.89 53.22 
МРК: confounded in replications 

1 2 25.21 12.60 

2 2 64.22 32.11 

3 2 6.39 3.20 

4 2 198.30 99.15 
Error 70 4,146.88 59.24 

Total 107 14,920.25 


Subdivision of part of the treatments s.s. 


N: L 1 1,012.50 1,012.50 ** 
Q 1 4.17 4.17 
P UE 1 917.35 917.35 ** 
9 1 0.04 0.04 
К 2 1 284.01 284.01 * 
Q 1 9.37 9.37 
NP: LXL 1 184.08 184.08 
Rest 3 215.19 71.73 
NK: LXL 1 256.69 256.69 * 
Rest 3 332.92 110.97 
NPK:LXLXL кыча» 59.12 59.12 


Notice that we do not calculate the total s.s. for treatments (with 26 
d.f.), since this contains part of the blocks s.s. 
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Step 3. There remains the calculation of the contribution from the 
МРК interactions. Before doing this, it may be remarked that some- 
times, from the nature of the factors or from previous experience, there 
is good reason to believe that the three-factor interactions will be neg- 
ligible. In this case the experimenter may decide to pool the sum of 
squares for NPK with the error s.s. without troubling to compute the 
sum of squares for NPK. The pooled error would have 78 d.f. and would 
be obtained by subtraction of the sum of squares calculated in step 2 
from the total s.s. The danger from this procedure is that if three- 
factor interactions are present the experimenter may not detect them, 
and moreover his estimate of error will be inflated. For many types 
of field experimentation in agriculture the procedure seems reasonably 
safe, When there is doubt it is better to isolate NPK. 

Consider the two components of NPK that are confounded in rep- 
lication 1. The contribution of these components to the sum of squares 
for NPK must be calculated from the remaining 3 replicates, in which 
they are unconfounded with blocks. The totals needed are shown in 
table 6.11 under the heading “МРК components.” From the treat- 
ment totals, compute the total (944) of the 9 treatment combinations 
that appear in block 14. Thus 


Block 1A = 012 + 122 + 220 + 202 + 101 + 021 + 000 + 110 + 211 
944 = 119+ 81+ 92+ 96 + 82+ 99+171+118+ 86 


In the same way we obtain 1102 for the total over all treatments that 
appear in block 1B and 1131 for block 10. Underneath these figures 
we place the respective totals for blocks 14, 1B, апа 1С. By subtraction 
we obtain the totals 773, 748, and 737. These are the totals of the groups 
of treatment combinations taken over replications 2, 3, and 4. The sum 
of squares of deviations of these quantities from their mean is divided 
by 27, since each figure contains 27 observations. The result, 25.21, is 
the contribution of the two components of МРК to the sum of squares 
for МРК. The remaining six components are found similarly from rep- 
lications 2, 3, and 4. Of course, all eight components could be computed 
in one step as 


3 ((773)* + (748)?-+ + «++ (701)?-+ (071??] — 54 ((2258)? + - - - + (2274)"] 
The error s.s., with 70 d.f., is calculated by subtraction. 


Step 4. In an experiment of this type, where each factor has equally 
spaced levels of an ingredient, it is usually advisable to examine the linear 
and quadratic components of the response curves. The contributions 
to the treatments s.s. are displayed in the lower section of table 6.12. 
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The method of calculation was described in section 5.26, where the data 
for N and P in this experiment were used as an example. All three fer- 
tilizers show significant linear responses, with no indication of any de- 
parture from linearity. 

With regard to the two-factor interactions, the “linear by linear" com- 
ponents have been isolated for NP and NK (see section 5.26 for the 
method). For PK it was not thought worth while to do this, since the 
total в.з. for PK, 212.89, is not large enough to allow any single compo- 
nent to be significant. The “linear by linear” component is significant 
for NK but not for МР. 

Where the “linear by linear” components of the two-factor interactions 
are large, it is desirable to isolate the “linear by linear by linear” compo- 
nent of the three-factor interaction. This will be done as an exercise. 
First we must define this component. With two factors, say N and P, 
the МІРІ, component has been defined as 

(22) -- (00) — (20) — (02) 
where the numbers refer to the levels of М and P. The value of this 
quantity at the highest level of К, minus its value at the lowest level of 
К, measures the linear effect of К on NzPz, and is the NP; Kr; compo- 
nent in question. Algebraically, it is 
L = (222) + (200) + (020) + (002) — (022) — (202) — (220) — (000) 
The estimate from the treatment totals will be found to be —27. 

Since three-factor interactions are partially confounded with blocks, 
this component is also partially confounded and must be adjusted so as 
to remove block effects, The nature of the confounding is вееп Әу writing 
down the expression above in the original table of plot yields (table 6.10), 
‘This has been done by means of the + and — signs that appear to the 
left of the seedling numbers іп the table. It is evident that Z is orthog- 
onal to the totals of blocks 1C, 2B, 3C, and 4С, since each of these 
blocks has опе + and one —. In blocks 1B, 20, ЗВ, and 4B, L has three 
+ signs, while in the remaining blocks it has three — signs. Hence, à 
quantity that is free from block effects is 

3L — (1B) — (20) - (3B) — (4B) + (14) + (24) + (BA) + (44) 
or 
3(—27) — 354 — 134 — 232 — 290 + 171 + 308 + 197 + 302 = —113 
the block symbols denoting block totals. This expression, apart from а 
divisor, may be shown to be the least squares estimate of Nr P; Kr, ad- 
justed for block differences. This estimate requires the assumption that 
all other components of the МРК interaction are negligible. 

И this quantity is expressed algebraically as a linear function of the 
plot yields, the sum of squares of the coefficients is found to be 210. Тһе 
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contribution of N Pr Kr, to the treatments s.s. is therefore (113)2/216, 
or 59.12, which is practically the same as the error m.s. If the number 
of replicates differs from four, the procedure for isolating the "linear by 
linear by linear” component remains the same, except that the divisor for 
the square becomes 54r instead of 216. 


Step 5. 'This concerns the presentation of the results. Usually it 
will be sufficient to show the three two-way tables of means, which are 
derivable from the two-way tables of totals (in table 6.11) on division 
by 12. Since all main effects and two-factor interactions are uncon- 
founded, standard errors and t-tests for the 3 X 3 tables are obtained 
just as in a randomized blocks experiment. The principal results are 
that each fertilizer has produced a significant decrease in the numbers 
of seedlings that emerged, the decrease being substantially proportional 
to the amount of dressing. The significant VK, interaction represents 
the fact that the decrease in emergence from no to ng was smaller at the 
Кә level than at the ko level. There is some indication of a similar effect 
with N and P, though this did not attain significance. 

The individual 27 treatment totals or means cannot be used as they 
stand for interpretative purposes, since they contain some block effects. 
A table of these totals or means will probably be unnecessary unless some 
aspect of the three-factor interaction requires study. To obtain such a 
table, we adjust each total so as to remove block effects. Each block 
effect is first estimated. For this purpose we do not use simply the ob- 
Served block mean, since that in turn contains treatment effects. The 
least squares estimate of any block effect is 


ФИ “(Бос total) — (total of treatments appearing in the block)] 


The basic data needed are available in the section headed “МРК com- 
ponents" in table 6.11. Thus, for block 14, the estimated effect is 


54171) — (944)] = —9.6 
The block effects are given below. 


Block 
e c — 
Replication А В с 
1 —9.6 +11.6 +16.5 
2 +4.2 —4.0 —15.1 
3 -8.8 —5.4 —5.4 
4 +3.9 46.3 +6.0 


To adjust any treatment total, we note from the plan the 4 blocks in 
which it appears, and compute the sum of the effects for these 4 blocks. 
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This quantity is subtracted from the unadjusted total to give the ad- 
justed total. Thus for nıpıko, which appears in blocks LA, 24, ЗВ, and 
4B, the adjusted total is 


118 — [—9.6 + 4.2 — 5.4 + 6.3] =: 122.5 
To obtain the adjusted mean we divide by 4 as usual. 


6.18 Міхей Series: the 3 X 3 2 Factorial. In the designs discussed 
in previous sections, all factors have the same number of levels. When 
the number of levels differs from one factor to another, the utility of 
confounding is limited. Usually some two-factor interactions must be 
confounded, and the computations become more laborious. Only five 
designs of this type are given at the end of this chapter, though a number 
of others are available in the literature. As an example, we will consider 
the confounding of a 3 X 3 X 2 factorial, which has 18 treatment com- 
binations. 

The main effects of a factor will be kept clear of block effects if every 
block contains each level of the factor the same number of times. Thus 
the factor A, which occurs at 3 levels (0, 1, 2), is unconfounded if every 
block contains an equal number of 0's, 1’s, and 2’s. Consequently, block 
size has-to be a multiple of 3. Similarly we can keep the main effects of 
В (0, 1, 2) unconfounded if the block size is a multiple of 3. For the 
factor C at 2 levels (0, 1), the block size must be a multiple of 2. Hence, 
in order to keep all main effects clear of blocks, the only feasible block 
size is 6. 

Further, with 6 units in a block, every possible combination of the 
levels of A and those of C can appear once in every block, so that we may 
expect to be able to keep AC, and likewise BC, unconfounded. We can- 
not place all the 9 combinations for AB in a block, so that AB will be 
partially confounded. 

From this approach the plan can be constructed rather easily. Every 
block is to contain the 6 combinations (00), (10), (20), (01), (11), (21) 
of Band C. The only question is the manner in which the 3 levels of A 
are combined with the 6 pairs above. This allocation is to be such that 
all 6 combinations of AC appear in the block. That is, the 0, 1, and 2 
levels of A must each appear with the 0 level of C and each with the 1 
level of C. We may impose one additional rule, designed to confound 
AB as little as possible. Although we cannot represent all 9 combina- 
tions of AB in a block, it will be well to represent as many as possible. 
Thus we introduce the restriction that any AB combination (e.g., 12) 
must not appear more than once in a block. 

Under these rules only four types of replicate can be made up, as shown 
in table 6.13. 
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Consider, for instance, in how many ways the block containing (000) - 


can be constructed. The first three A levels (reading from the top in 
table 6.13) must be either 0, 1, 2 or 0, 2, 1, so that each A level shall ap- 
pear with the 0 level of C. If 0, 1, 2 is chosen, the remaining three А 


TABLE 6.13 POSSIBLE BLOCKS FOR A 3 ХЗ X 2 FACTORIAL 


Level of A 
B C | Ia. © Ie | Па Hb Пе | Ша Wb IIIc| IVa IVb IVe 
0 2 к 440 2: ея a rs. 2 0 1 
1- 0 2 0 1 0 1 2 0 1 2 1 2 0 
20 Uwe Ду! „2 0 a No alk 0 1 2 
0 1 2 0 1 1 2 0 2 0 1 1 2 0 
1.6 0 1 2 2 0 1 1 2 0 0 2 2 
21 1 2 0 0 1 2 0 1 2 2 0 1 


levels must be either 1, 2, 0 or 2, 0, 1, these being the only sequences that 
make no AB combination appear twice. ` These sequences give blocks 
Ic and IIb respectively. Similarly, the choice of 0, 2, 1 for the first three 
A's leads only to blocks IIIc and IVb. 

In the same way, we find that there are only four possible blocks con- 
taining the treatment combination (001) and four containing (002.) 


The three sets of 4 blocks can be grouped into 4 separate replications 


(1, П, Ш, and IV in table 6.13). — — 
As Yates (6.1) has shown, the plan can. be written in a more con- 
densed form (table 6.14) which exhibits the nature of the confounding. 


TABLE 6.14 CONDENSED FORM OF THE PLAN FOR.A 3 X 3 X 2 CONFOUNDED 
FACTORIAL 


- Blocks 


Io Е Ш ID IL | Ie Шь Ш. | IVe IV». IVe 


Га UM CUN UI: а Ы TUTTA MAD 4. 
14 фала ^2 Дерт ір dw дала Jg A 


For example, in block I, the 3 combinations that appear with the 
zero level of C are (10), (21), and (02). This is the group that forms the 


6.19 NUMERICAL EXAMPLE OF THE 3 X 3 X 2 DESIGN 205 


Iz component of АВ (see section 6.15). It follows that in the first 2 
replicates the 7 component of AB is partially confounded; the J compo- 
nent can be verified to be unconfounded. In replicates III and IV the 
J component is partially confounded, while the 7 component is clear of 
blocks. ABC is partially confounded in all replicates. The relative in- 
formation on AB has been shown to be 74 and that on ABC, %. 


6.19 Numerical Example of the 3 X 3 X 2 Design. Table 6.15 shows 
the plan and analysis of variance for an experiment on the response of 
young tung trees to fertilizers. The results of this experiment have been 
reported by Merrill, Kilby, and Greer (6.17). The general object was to 


TABLE 6.15 DATA ғов 3 X 3 X 2 FACTORIAL EXPERIMENT IN BLOCKS OF 6 UNITS 


Ia Ib Te Ila IIb пе 
100: 80 200: 78 000: 38 200:136 000: 48 100: 89 
210: 86 010: 55 110: 73 010: 56 110: 81 210: 87 
020: 70 120: 82 220: 75 120: 64 220: 90 020: 66 
201: 74 001: 67 101: 78 101: 95 201: 81 001: 91 
011: 82 111: 67 211: 51 211: 76 011: 61 111: 97 
121: 86 221: 57 021: 66 021: 71 121: 65 221: 60 

478 406 381 1,265 498 421 490 1,409 

Ша шь IIc IVa IVb IVe 
100: 86 200: 73 000: 66 200: 88 000: 53 100: 81 
010: 79 110: 76 210: 85 110:107 210: 66 010: 58 
220: 73 020: 97 120:101 020: 70 120: 92 220: 56 
201: 97 001:116 101:117 ; 101: 79 201: 88 001: 90 
111: 79 211: 86 011:106 { 011: 92 111:109 211: 68 
021:113 121: 81 221:102 221: 96 021: 95 121: 67 


527 529 577 1,633 532 503 420 1,455 


Analysis of variance 


Source d.f. 8.8. m.s, 
Replications 3 3,837 1,279 
Blocks within replications 8 2,836 354 
Levels of 8-8-6 їп row (A) 2 1,116 558 
Meals (B) 2 254 127 
Levels of 8-8-6 side-dressed (C) 1 868 868" 
8-8-6 in row X meals (АВ) 4 1,129 282 


8-8-6 in row X 8-8-6 side-dressed (AC) 2 2,995 1,498 ** 
Meals by 8-8-6 side-dressed (BC) 2 424 212 
8-8-6 in row X meals X 8-8-6 side-dressed (ABC) 4 1,016 254 
Error 43 8,909 207 


E] 
В 
g 


Total 
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discover the type of fertilizer application that would stimulate the early 
growth so that the young trees would be ready for transplanting to com- 
mercial orchards by early autumn. The data shown are the mean heights 
in centimeters of the 12 tung trees on each plot. For convenience in 
following the computations, the plot yields follow the same order as in 
table 6.13. Note that factor A appears last in table 6.13 and first in 


table 6.15. 
The three factors were as follows, all dressings being per acre. 
A B с 
8-8-6 fertilizer applied 8-8-6 fertilizer applied 
in the row as side-dressing 
0 None 0 None Q None 
1 200 Ib. 1 Tung meal (650 Ib.) 1 2001b. 
2 400 lb. 2 Cottonseed meal (650 Ib.) 


The side-dressing was placed in a shallow furrow on one side of the 
row about 8 inches from the tung seedlings. Factors A and B were ap- 
plied at planting (March 13), factor C on June 3, and the height measure- 
ments were made on October 23. It should be noted that all three fac- 
tors supply each of the common plant nutrients (N, P, and К). 

In an experiment of this type, one might anticipate that two-factor 
interactions will be present. If, for example, 400 Ib. of fertilizer applied 
in the row supply all the nutrients that the young tree can utilize, plots 
that receive this dressing will show no response to either factor B or C. 
On the other hand, plots that receive the zero level of A may respond to 
the dressings supplied in factors B and C. It would not be wise to con- 
found any two-factor interactions heavily. Since, however, the soils 
available for experiments had been found to be very variable, it was de- 
cided to arrange the experiment in blocks of 6 plots. As we have seen, 
this will partially confound AB, though as the relative information is 
high (7), not much information is lost on this comparison even if the 
incomplete blocks prove ineffective. 

Table 6.16 shows the computation sheet leading to the analysis of 
variance. The steps are as follows. 


i. Calculate the block totals (denoted by Ia, Ib, etc., and shown in 
table 6.15), and the treatment totals (table 6.16). Calculate the total 
s.s. and the blocks s.s. The latter has been divided into two com- 
ponents—replications (3 d.f.) and blocks within replications (8 d.f.). 
This separation enables us to estimate the decrease in error variance due 
to the reduction in block size from 18 to 6 plots. "Тһе analysis of variance 
appears at the foot of table 6.15. 


ii. Form the two three-way totals as shown. From the margins of 
the table for (c; + co), calculate the sum of squares for the main effects 
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TABLE 6.16 COMPUTATION SHEET FOR 3 X 3 X 2 EXPERIMENT, BLOCKS OF 


6 UNITS 
(1) Treatment totals 
co сі 
% by be bo by bz 
40 200 248 303 364 341 345 
a 336 337 339 369 352 299 
аз 375 324 294 340 281 315 
(ii) Two-way tables 
€i + co €i — co 
bo by be Totals bo bi be Totals 
ao 564(1) 589(4) 648(7) 1,801 | +1641) +93(4) --42(7) 299 
а 705(2) 689(5) 638(8) 2,032 | + 33(2) --15(5) —40(8) 8 
аз 715(3) 605(6) 609(9) 1,929 | — 35(3) —43(6) +219) —57 
Totals | 1,984 1,883 1,895 5,762 162 65 23 250 


(ii) From (сі + со) 
Iı = (1) + (5) + (9) = 1,862 
1% = (2) + (6) + (7) = 1,958 
1з = (8) + (4) + (8) = 1,942 


5,762 
20! = 21 + Ia + Ila = 4,700 
2) = 2I; + Io + Ilp = 4,743 
2I; = 2I; + I, + Ile = 4,755 


14,198 


s.s. for AB = 


(4,700)? + (4,743)? +-+--+ (5,101)? 4” (14,198)? + (14,612)? ш 


(1) + (6) + (8) = 1,807 
(2) + (4) + (9) = 1,903 
(3) + (5) + (7) = 2,052 


5,762 
2J,' = 27; + III, +1Va = 4,673 
27.’ = 275 + Шь + ТУь = 4,838 
27у = 275 + Ш, + 1У. = 5,101 


Л 
J2 
Js 


па 


14,612 


1,129 


84 
(іу) From (сі — со) 
Ву = +200, Re = +32, Ез = +18; 


where, e.g., 


№ = (1) + (5) + (9), - 


252 


Sı = +81, Se = +147, Ss = +22 
+, Sa = (8) + (5) + (7) 


2Ry = 28: — Io + Ie + Il, — П, = +306 
28 = 28 + Ia — I; — Wa + Ue = +153 
283 = 2R; — + + Ha — Ш = + 41 


+500 


251 = 28, — III; + III, + IV; — ІУ, = 4-293 
25)! = 28s + Ша — III, — IVa + IV, = +132 
2583 = 253 — Ша + Шь + IVa — IVe = + 75 


(+306)? + (+153)? +- -- + (+75)? _ (+500)? 


+500 


в.в. for ABC = 


60 


gp = 1018 
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of A and В. From the total of the table for (с1 — со), we calculate the 
sum of squares for the main effect of С: this is (250)?/72, since there are 
72 plots. Similarly, the margins of the table provide the sum of squares 
for the AC and BC interactions, which being unconfounded are obtained 
in the usual way. 


iii. These steps lead to the sum of squares for AB. First calculate the 
I and J components of AB, which are obtained by summation from the 
(сі + со) table in step ii, as shown in table 6.16 (iii). Then apply the ad- 
justments for block effects. Since 11, for instance, does not appear in 
blocks Га or Па, the adjustments involve the totals for these two blocks. 
The adjustments lead to the quantities 271”, ete., from which the sum of 
squares for AB follows as shown. 


iv. These steps lead to the sum of squares for ABC. From the table 
for (сі — co), step ii, form totals В; and S; which are calculated by the 
same rules as for the Гв and J’s, respectively. These totals are then ad- 
justed for block effects, yielding quantities 2R,’, 28;', etc., from which 
the ABC s.s. is easily obtained. The error s.s. is then computed by sub- 
traction. 


v. Summary and presentation of results. Consider first the con- 
clusions indicated by the analysis of variance. The only significant mean 
square for a main effect is that for C (side-dressing). The AC interaction 
is significant at the 1% level, while the mean square for A, though not 
significant, is substantially above the error m.s. Neither B (meals) nor 
any of its interactions approaches significance. These results suggest 
that the AC two-way table should be examined. 

Since table 6.17 contains no confounded effects, the means per plot 
are found simply by addition and division from table 6.16, (i). It is 


TABLE 6.17 АС two-way TABLE 


Mean heights (centimeters) + 4.15 


45a Side dressing 
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evident that there are large and significant responses to the 200 Ib. dress- 
ings, both when applied in the row and as side-dressing. The means, 
84.3 for the row application and 87.5 for the side-dressing, do not differ 
significantly, so that the applications appear to have been about equally 
effective. There is no further increase to the 400 Ib. row dressing, or to 
mixtures containing both types of dressing. Thus the large AC inter- 
action arises primarily because each dressing is effective only in the 
absence of the other. 

The analysis of variance would indicate that uo definite effects of the 
meals (B) can be established. As an exercise, the AB two-way table, 
which involves partially confounded effects, will be examined. The 
means must be adjusted for block effects. This could be done by first 
calculating a mean for each incomplete block, adjusted for treatment 
effects, and then adjusting each AB mean for the 4 blocks in which it 
lies. It is slightly quicker to perform the adjustments by means of the 
I and J effects, as follows. 

In table 6.16, (iii), divide the quantities 271”, etc., by 42, and calculate 
the deviations from their mean. The quantities 2,717, etc., are treated 
similarly. The same procedure is carried out for the Гв and J’s, except 
that the divisor is 24. This gives 
iY =-08 = 02, у= 05 jy =—47, foo =-08, fo = 54 

y= -24, й= 16, i= 0.9 A= —-47, j= —0.7, һ- 55 
ôi = +1.6, ӧ = —14, 58 = —0.4 81 = 0.0, X» = —0.1, 5) = —0.1 

The unadjusted mean of (aobo) is 70.5. From table 6.16, (ii) and (iii), 

we see that (aobo) belongs to Iı and Jı. Hence its adjusted value is 


(aobo)' = (aobo) + ёй + д = 70.5 + 1.6 + 0.0 = 72.1 
By the same principle, the adjustment to (арб), for instance, is 
(5% + ёл). The complete two-way table is shown below. 


TABLE 6.18 ` АВ TWO-WAY TABLE, ADJUSTED FOR BLOCK EFFECTS 


Mean heights (centimeters) 
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The standard error of a single figure in the table is У 207/8, or 5.09, 
since each figure is the mean of 8 plots. This standard error applies to 
t-tests of unconfounded effects; i.e., of the main effects of A and В. For 
t-tests of components of the AB interaction, the figure must be multiplied 
by V 8/7, or 1.069, the value 8/7 being the inverse of the relative effi- 
ciency on АВ. 

For t-tests of quantities that are a mixture of main effects and inter- 
actions, the multiplier lies between 1.069 and 1. It will usually be 
sufficiently accurate to use the multiplier 1.069. If desired, the correct 
standard error may be obtained by means of a rule which will be given 
without proof. The quantity whose standard error is wanted is expressed 
in terms of components of main effects and interactions. Suppose that 
we wish to test the response to cottonseed when no row fertilizer is 
applied. The response is (79.5 — 72.1), or 7.4 сш. The approximate 
standard error is 5.09 X 1.069 X 4/2, or 7.69. To obtain the correct 
value, we write symbolically, 

(b2 — bo)(ao + ai + аз) (6 — bo)(2ao — ay — аз) 

ЖҮ; 
8 3 

If all means are unadjusted means, this is an algebraic identity. The 
first term on the right is a component of the main effect of B, while the 
second is a component of the AB interaction. Further, by a property of 
least squares, the equation remains valid if means adjusted for block ef- 
fects are substituted for unadjusted means. By the rule for the variance 
of а linear function, the variance of the first term on the right is 602/9, 
where c? is the variance of a single figure. By the same rule, the variance 


of the second term is 
12\ /8 
(6^ 
9/ \7 


the factor 8/7 being introduced because the second term is а component 
of AB. The two terms are orthogonal, so that the estimated standard 
error of (bz — Бо)ао is 


5.09 V/$ + (52)(8) = 5.092.190 = 7.52 


Adjustment of the 18 individual treatment means is more tedious. In 
table 6.16, (iv), we divide the quantities 2Е1', ete., by 30, and the quan- 
tities Ry, etc., by 24, thereafter taking deviations from the mean of each 
trio. This gives 


(by — bo)ao = 


т = +46, mn'--—05, т/--42  s'-442, s -—12, зу = —81 
n= +48, п=-22 n-—27 4--01, s-426, s--26 
ёз = —02, ӛз = 41.7, ör = —15 ды = 443, ös = —3.8, ñ= -0.5 
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In this experiment there is little point in adjusting all 18 treatment 
means. However, the unadjusted treatment totals, table 6.16, (i), sug- 
gest that there might be an effect of the meals (factor B) when applied 
alone. Hence it may be of interest to test the difference between the 
adjusted means of agbecy and аобосо. 

From table 6.16, (iii) and (iv), we see that аобосо belongs to й, ji, 71, 
and sı. Since the unadjusted mean is 50.0, the adjusted mean is 


50.0 + ё + 67, — dry — às, = 
50.0 + 1.6 + 0.0 + 0.2 — 4.3 = 47.5 cm. 


Note that the terms іп àr and às carry a minus sign whenever the treat- 
ment is at the со level, and a plus sign when the treatment is at the сү 
level. For аобәсо, which belongs to 2», ja, тә, and эз, the adjusted mean 
will be found to be 73.1 em. 

In order to obtain the standard error of the difference (73.1 — 47.5) 
= 25.6, this must be expressed in terms of components of main effects 
and interactions. This requires some practice, but the method is not 
hard to master. Consider the following table of values of (bo — bo), 
averaged. over the 4 replicates. 


(b, — bo) 
40 a а; | Total 


[7] d € f 1 
сі 9 h k m 


Total | т o ? 4 


The quantity whose standard error we wish is 4. Now it is easy to 
verify that 
6d=q+(1—m)+ 2n—0~p) +{(2d—e-f) — (00—һ—Ю)] 
components of B BC AB ABC 


As before, this is an identity for the unadjusted means and remains valid 
for adjusted means. Hence by the same rule as before 


а? 8 8 
36074 = SN [2 4-12 4- 24 e + 24 cy iim 22.4607 


where c? is the estimated variance per plot. The extra factor 8/7 is used 
for the AB component and the factor 8/5 for the ABC component, for 
which the relative information is 5/8. Consequently, the estimated 
standard error of d is 

22.46 X 207 


36 


= 11.4 em. 
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Tt follows that the ¢-value for the response to cottonseed when applied 
alone is 25.6/11.4, or 2.25, with 43 d.f., which is significant at the 5% 
level. 

These results illustrate a feature that is common to the mixed series of 
confounded factorials. If the examination of the results is directed 
mostly at simple rather than factorial effects, the analysis is likely to be 
time-consuming because of the adjustments needed to remove block ef- 
fects and to construct standard errors by the method given above. It 
will also be apparent that considerable experience in the analysis of 
variance, or access to expert advice on this topic, is advisable. 


6.2 The Use of Confounded Designs 


6.21 General Recommendations. In order to make efficient use of 
confounding, it is necessary to consider whether the advantage is likely 
to outweigh the disadvantages. 

The advantage comes from the reduction in the experimental error 
by the use of a block which is more homogeneous, or which can be sub- 
jected to a more uniform technique, than the complete replicate. With- 
out some idea of the amount of this reduction, a realistic decision cannot 
be made on the question of confounding. If uniformity data have been 
collected, the experimenter may compare the variability within the in- 
complete blocks with the variability within replicates. In addition, 
from the results of an experiment in which confounding has been em- 
ployed, it is usually possible to estimate what the experimental error 
would have been if confounding had not been used. The method of cal- 
culation, which was first outlined by Yates (6.5, р. 215), is given in зес- 
tion 6.35. In many types of research the reduction in error from con- 
founding varies greatly from experiment to experiment. In such cases 
little reliance can be placed on the results of a single trial; a summary of 
a moderately large group of experiments is necessary. 

The disadvantages of confounding consist of (1) the reduction in 
replication on the confounded treatment comparisons, and (2) in some 
cases, a greater complexity in the calculations. With regard to the first, 
the experimenter should take into account the importance of the inter- 
actions that must be confounded and the extent to which they are con- 
founded. No interaction should be completely confounded unless there 
is good reason to believe, either from previous experience or from the 
nature of the factors to be tested, that the interaction will be negligible. 

As an illustration of the type of reasoning that might be employed in 
reaching a decision, consider the 3 X 3 x 2 design discussed in the pre- 
vious section. The incomplete blocks contain 6 units, with 7/8 relative 
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information on AB and 5/8 relative information on ABC. ABC is ex- 
pected to be negligible; AB, however, is of sufficient interest so that con- 
founding is undesirable if it involves any loss of precision for this com- 
parison. The issue resolves itself into the question: is the error variance 
per unit with blocks of 6 units likely to be less than 7/8 of the error vari- 
ance with blocks of 18 units? If so, the AB interaction is estimated more 
precisely from the confounded design than from randomized blocks, be- 
cause the reduction in variance per unit more than compensates for the 
loss of replication due to confounding. Since the relative information is 
useful for this type of argument, it is given in table 6.19 for all the plans 
presented at the end of this chapter. 

The increase in the amount of computation varies both with the type 
of design and with the type of analysis needed to summarize the results. 
Where all the confounding is complete, the statistical analysis is prac- 
tically the same аз with randomized blocks. With all factors at two or 
three levels (1.е., in the 2” and 3” series), the extra computations are not 
formidable even when partial confounding is used; they are much more 
troublesome when the number of levels is not the same for all factors, 
as in the 3 X 2? and 3 X 2? designs. The extra work consists in the cal- 
culation of adjustments to the partially confounded comparisons so that 
their sums of squares in the analysis of variance are freed from block 
effects. If it is desired to present tables of mean yields which involve 
confounded comparisons, the entries in the tables must be adjusted so 
as to remove block effects. This point should be remembered, especially 
with designs where two-factor interactions are confounded. If t-tests of 
simple effects are needed, special computations must be made to obtain 
the standard errors for these tests. Moreover, if part of the data is miss- 
ing, the analysis tends to become much more complicated than with 
randomized blocks. 

The experimenter who has not previously used confounding and who 
does not have access to expert statistical advice is advised to confine him- 
self at first to the simplest types of confounding. He should make sure 
that he understands the method of analysis before the experiment 18 соп- 
ducted. After the experiment is completed, he should investigate the in- 
crease in precision that has been achieved by confounding, as a guide to 
the future use of the technique. 

Yates (6.5) has pointed out that in certain circumstances a confounded 
design might give rise to a misleading interpretation of the results. This 
can be illustrated by means of the 2? factorial in table 6.1 (р. 183). This 
has three replicates, with ABC completely confounded in each replicate. 
Suppose that the response to A is much larger than any other effects, 
and that this response varies greatly from block to block. That is, the 
interaction of A with blocks is much larger than the interactions of the 
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other factors with blocks. Now the factorial effect total for BC may be 
written 


BC = (abc) + (bc) + (a) + (1) - (ab) — (ac) - (b) — (0 
= (аһ) + (а) - (b) — (с)] — (а) + (ac) - (bc) - (1)] 
= response to A in blocks 1, 3, 5 — response to A in blocks 2, 4, 6 


The point is that the factorial effect of BC is a component of the inter- 
action of A with blocks. When tested against the error m.s. (12 d.f.) 
this response might be significant, not because there is an effect of BC, 
but because A has large interactions with blocks. 

Another way of describing the source of the trouble is to say that the 
error m.s. is not homogeneous, in that the interaction of A with blocks 
is larger than the interactions of other treatments. If this were sus- 
pected, the danger of a misleading interpretation could be avoided by 
subdividing the error m.s. We can isolate 4 d.f. (two from blocks 1, 3, 
and 5 and two from blocks 2, 4, and 6) that represent the interaction of 
A with blocks. The mean square for these four could be used to test A 
and BC, while the remaining factorial effects would be tested by the other 
8 d.f. for error. This device can usually be applied in the 2” series, though 
generally not in the mixed series. Further, the heterogeneity of the 
error variance may not be suspected. 

How frequently this danger arises will depend on the nature of the 
experimentation. Investigations by Yates (6.5) and Kempthorne (6.9) 
suggest that there is little danger with fertilizer experiments on English 
farm crops. In lines of research where errors have often been found to be 
heterogeneous, one should be on the alert for this effect. 

In some experiments the use of confounding is practically unavoidable. 
For instance, an experiment on methods of manufacturing ice cream, 
planned at North Carolina State College, contained 3 factors each at 3 
levels. It was desired to discover whether ordinary consumers could 
detect differences in the palatability of the 27 different types of ice cream. 
From previous experience it was believed that a taster (who was not an 
expert judge) could compare at most 3 ice creams at any single test. 
Moreover, it did not seem feasible to ask the tasters to grade the pal- 
atabilities on any kind of standard scale: they could merely be asked to 
rank the 3 ice creams in order of preference, if they had any preferences. 
Consequently, a design for a 3° factorial in blocks of 3 units was re- 
quired, even though this involved confounding of two-factor interactions. 
The design adopted also employed partial confounding of the main ef- 
fects, so as to obtain approximately equal accuracy on main effects and 
two-factor interactions. 
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TABLE 6.19 INDEX TO PLANS OF FACTORIAL EXPERIMENTS CONFOUNDED IN 
RANDOMIZED INCOMPLETE BLOCKS 


а. Designs with which any number of replicates may be used 


Num- Num- 
ber of Бег of 
treat- units per 
Type ments block Interactions confounded in a single replicate * Plan 


2 8 4 АВС 6.1 
2 16 8 ABCD 6.2 
2 16 4 AB, ACD, BCD 6.4 
25 32 8 АВС, ADE, BCDE 6.5 
ж 64 16 ABCD, АВЕР, CDEF 6.3 
25 64 8 ABC, CDE, ADF, ВЕР, ABDE, BCDF, ACEF 6.6 
38 27 9 АВС (34) 6.7 
8% 81 9 ABC, ABD, ACD, BCD, all (34) 6.8 
4 16 4 АВ (24) 6.12 
4x2 16 8 ABC (24) 6.13 


*The fractions in parentheses give the relative information on the comparisons 
which are confounded. Where no fraction is given, the comparison is completely 
confounded. 

b. Balanced designs 


Number of 
replicates 
Number Units fora Interactions confounded and 
of treat- рег balanced relative information (in paren- 
Type ments block design theses) * Plan 
z 16 4 бп f All two-factor (56); all three-fac- 
tor (14) 6.4 
25 32 8 5n All three-factor (44); all four-fac- 
tor (4%) 6.5 
2 64 8 10n All three-factor (44); all four-fac- 
tor (44) 6.6 
33 27 9 4n All three-factor (34) 6.7 
8* 81 9 4n All three-factor (34) 6.8 
3x2 12 6 3n 1 BC (8), ABC (56) 6.9 
8x2 24 6 3nł ВС, BD, CD, all (86); ABC, ABD, 
ACD, all (56) 6.10 
3х2 18 6 4n AB (1), ABC (5%) 6.11 
4? 16 4 3n AB (24) 6.12 
4х2 16 8 3n ABC (24) 6.13 
4хзх2 24 12 9n { AC (2661), АВС (2341) 6.14 


* The factors ABC >>. are read from the left; thus the BC interaction in a 3 X 2? 
design is the interaction between the 2 factors at 2 levels. 

t The symbol “бл” denotes that the number of replicates should be а multiple of 6. 

1In these cases only the balanced design is recommended. 
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6.22 Index to Experimental Plans. The opportunities for confound- 
ing vary with the number of factors and with the number of levels of each 
factor. The plans given at the end of this chapter are those that are 
likely to be most frequently used. 

Table 6.19, which forms an index to the plans, is divided into two parts. 
The first part shows a group of designs in which any number of replicates 
may be used. In successive replicates, the experimenter may either (i) 
repeat the first replicate (with a new randomization), in which case the 
same comparisons are confounded in all replicates, or (ii) change the 
grouping into blocks in successive replicates, so as to spread the соп- 
founding over a greater number of comparisons. In the plans, the fac- 
torial effects that are confounded are shown separately for each replica- 
tion. Consequently, the reader can readily see what possibilities are 
open in this direction. Where confounding can be restricted to a high- 
order interaction, as in the 2* design in blocks of 8 units, the first pro- 
cedure is preferable. On the other hand, with the 2* design in blocks of 
4 units, where one two-factor interaction must be confounded in every 
replicate, the confounding should be changed in successive replications, 
unless it is certain that one of the two-factor interactions will be neg- 
ligible. More detailed recommendations for the individual designs are 
given in section 6.3. 

The second part, table 6.19b, shows a group of balanced designs, in 
which all interactions of the same order and the same type are con- 
founded to the same extent. Thus in the 3 X 2 X 2 X 2 (or 3 x 23) 
design, all first-order interactions between the factors which have 2 levels 
are equally confounded, the relative information being 8/9. The first- 
order interactions between the factor at 3 levels and any factor at 2 levels 
are unconfounded. Balanced designs are useful in cases where all inter- 
actions of the same order and type are of equal interest, so that it is un- 
desirable to confound some more fully than others. Also, the computa- 
tions are usually simpler. For this reason, only the balanced designs 
should be used in certain cases, as indicated in table 6.19. On the other 
hand, with balanced designs the available numbers of replications are 
severely limited. 

Table 6.20 shows a number of designs for which plans are given in the 
references cited. "These designs are mainly for mixed series, and will not 
be discussed here. The most recent addition to this list consists of à 
series of designs by Binet её al. (6.12), which may be used either in single 
replication (see section 6.24) or in more than one replication. These de- 
Signs confound linear and quadratic components as little as possible. 
Table 6.202 presents selected replicates which appear to be most useful. 
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6.23 Randomization. The procedure is the same for all designs. 

1. Arrange the positions of the blocks at random within each replica- 
tion, using a new randomization for each replication. 

2. Allot the treatments in any block at random to the units in the 
block, using a new randomization for each block. 


TABLE 6.20 COoNFOUNDED DESIGNS FOR OTHER FACTOKIAL EXPERIMENTS 


Units 
рег Number of 
Type block replicates Interactions confounded Reference 
х2 12 2n AB (7$, ABCD (5) 6.6 
3x2 18 2n ABC, ABCD 6.6 
33 x 2 6 2n AB, AC, BC (7%), ABD, ACD, BCD, АВС 6.1 
4х3? 12 2n BC (7$, ABC 6.6 
2х2 16 Апу АВС 6.6 
4x3 12 3n АВ (2947), ABC 6.6 
43 16 Апу АВС 6.7 
4х2 8 3n ABC, ABD, ACD 6.6 
4 16 Апу АВС, АВР, АСР, ВСр 6.7 
p 5 4n AB (34) 6.8 
5x2 10 ön BC (2445), ABC 6.6 
БЫ 25 Апу АВС 6.8 


TABLE 6.200 DEGREE CONFOUNDING DESIGNS IN SINGLE REPLICATES * 


Units Relative information on 
per confounded first and 
Type block second degree effects Table* Plans 
3х2 3 Bi (86) 6 1 
3x3 4 and 5 Аз (36), Bs (96) 59 3 
3х2 9 С, (8981) 9 E 
“Хх” 12 A,B, (3342) 15 1-6 
12 АС: (143) 15 9and10 
18 Сар (891) 13 T 
4x2 4 None confounded 20 2 
4x3 6 А» (36) 21 3 
4? 4 A3B1 (2345) 29 2 
8 None confounded 27 land 2 
5х2 5 Bi (2545) 36 1 ара 3 
5x3 7 and 8 Аз and В; (2.9690) 61 РА 
5x4 10 В, (2445) 38 — land2 
5x3 15 В\С, (4360) 40 14. 
5x3x2 15 С: (299%) 42 7 and 8 
5? 121413 А:ап4 Вг (>99%) 62 3 


ж See reference Binet et al. (6.1а). А: is a linear (first degree) component, while 
Аз or A;B; are quadratic (second degree) components. 
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TABLE 6.20a (Continued) 


Units Relative information on 
per confounded first and 
Type block second degree effects Table Plans 

6х2 6 A,B; (>99%) 45 T 
6x3 9 41 (>99%) 46 1 
6x4 12 None confounded 47 2and 5 
6x5 15 41 (>99%) 49 H 

6? 18 41В1 (>99%) 59 1 
6х 22 12 None confounded 51 land 2 
6х3х2 18 A101 (>99%) 52 9 
7х2 7 By (4849) 55 ^ 
7x4 14 В» (4849) 56 

(Жары 14 B,C) (48%) 57 

7X3 10 and 11 Аз and B» (>98%) 63 

7 Х5 17 апа 18 Аз and В» (>99%) 64 


6.24 Experiments іп Single Replication. In most of the plans at the 
end of this chapter, the total number of treatment combinations is large. 
This fact limits the number of replications that can be employed. Some- 
times the resources are sufficient for only a single replication unless the 
experimenter omits some factors which he originally intended to include, 

With only one replication, we cannot derive an estimate of error from 
the interactions of treatments with blocks. If, however, certain high- 
order interactions are negligible, their mean squares in the analysis of 
variance will behave exactly like components of the error m.s., and there- 
fore can be used to provide an estimate of error. In the 2” series, the 
smallest design which provides an adequate number of error d.f, is the 25 
in blocks of 8 units (plan 6.5). In replicate 1 of this plan, ABC, ADE, 
and BCDE are completely confounded. If the remaining three-, four-, 
and five-facior interactions furnish the estimate of error, the analysis of 
variance appears as follows, 


d.f. 
Blocks 3 
Main effects 5 
Two-factor interactions 10 


Error (from high-order interactions) 13 


Total 31 
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6.3). It may sometimes be wise to deviate from these recommendations. 
For instance, in the 2° example above, the main effects of A, C, and D 
and the AC and AD interactions might all turn cut to be rather large. 
This would suggest that the ACD interaction may not be negligible, and 
that it should not be included in the interactions used aserror. A decision 
of this type should be made before examining ACD itself. The practice 
of examining the high-order interactions and using those that are small 
as error leads to a serious underestimation of the true error variance. 

The estimation of error from high-order interactions is, of course, open 
to criticism. If some of these interactions happen to be large, the error 
m.s. that is used will overestimate the true error variance and the fact 
that the interactions are large will not be discovered. Where a consid- 
erable series of experiments of the same general type is being conducted, 
some safeguard is obtained both by examining the high-order interactions 
in experiments that are replicated and by watching the two-factor inter- 
actions in experiments with single replication. If most two-factor inter- 
actions are small, it seems very unlikely (though still possible) that inter- 
actions of higher order will be large. If many two-factor interactions 
are found to be large, this suggests that some three-factor interactions 
may also be substantial. See Yates (6.5) and Cornish (6.10). 


6.3 Notes on the Plans and Statistical Analysis 


6.31 2" Series. In plans 6.1 to 6.6, one level of a factor is denoted 
by the corresponding letter; the other level by the absence of the letter. 
With designs that are not balanced, care must be taken to allocate the 
letters a, b.c, . . . to the factors in the experiment so that the interactions 
which are confounded are those that the experimenter has decided to 
confound. For instance, the first replication of plan 6.4 (2* design in 
blocks of 4) confounds the interaction AB, but no other two-factor inter- 
action. The factors labelled A and B should be those whose interaction 
is of least interest. 

The statistical analysis follows the same general method for all 2" de- 
signs. The total s.s. and the sum of squares among blocks are obtained 
by the usual methods. The factorial effects may be computed by one of 
the methods outlined in section 6.14. When this calculation has been 
completed, the sums of squares are added for all unconfounded compar- 
isons. Let this total be Sr. If all the confounding is complete, т is the 
treatment s.s., since the effects that are confounded appear already in 
the block s.s. For each comparison which is partially confounded, the 
sum of squares is calculated separately from those replications in which the 
comparison is unconfounded. The replications to be used are clear from 
the plans, which show the factorial effects that are confounded in each 
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replication. The total of these sums of squares is added to Sr to give 
the treatment s.s. The error s.s. is found by subtraction. 

23 factorial, blocks of 4 units. Plan 6.1, which confounds ABC com- 
pletely, may be used for every replication. For a worked example, see 
reference (6.1), p. 19. 

Alternatively, with 4, or a multiple of 4, replicates, we may confound 
partially all two-factor interactions and the three-factor interaction, the 
relative information on each comparison being 3/4. The plan and sta- 
tistical analysis are described in reference (6.1), p. 21. 

2* factorial, blocks of 8 units, Plan 6.2 should be used for every replica- 
tion, ABCD being completely confounded. 

Note that the treatment s.s. contains only 14 d.f. since the single de- 
gree of freedom for ABCD is included in the blocks. 

2* factorial, blocks of 4 units. Plan 6.4 gives a balanced design with 6 
replications. If a smaller number of replicates is used, the replicates 
chosen depend on the interactions which the experimenter decides to 
confound. Since one two-factor interaction is confounded in each rep- 
licate, it will usually be preferable to confound different comparisons in 
successive replications. For instance, if AB and CD are to be con- 
founded, replicates 1 and 6 should be selected. This arrangement gives 
1/2 relative information on AB, CD and the three-factor interactions 
ACD, BCD, ABC, ABD, all other comparisons being clear of blocks. 

With a 2* experiment, the investigator may be uncertain whether to 
confound in blocks of 8 units or in blocks of 4 units. The former design 
is recommended unless (a) the interactions which are confounded with 
blocks of 4 units are known to be small or (b) the block of 4 units is much 
more homogeneous than the block of 8 units. The latter condition may 
apply where the experimental material is extremely variable, or falls 
naturally into groups of 4 units. 

25 factorial, blocks of 8 units. The balanced design (plan 6.5) requires 
5 replicates. When fewer than 5 replicates are used, it may be satis- 
factory to confound completely two three-factor interactions and one 
four-factor interaction. In this case replication 1 is repeated the re- 
quired number of times. 

If some information is desired on all treatment comparisons, replica- 
tions 1, 2, --- may be used up to the proposed number. With replications 
1 and 2, for instance, the interactions ABC, ADE, ABD, BCE, BCDE, 
and ACDE are partially confounded, each with relative information 1 /2. 

The statistical analysis is described in reference (6.1), p. 27, for the 
case in which the experiment contains only a single replication. 

2° factorial, blocks of 16 units. Plan 6.3, which shows a single replica- 
tion with ABCD, ABEF, and CDEF confounded, may be used for all 
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replications. The 3 interactions above are completely confounded, and 
are omitted from the treatment s.s. in the analysis of variance. 

If there is only 1 replication, an estimate of error may be obtained 
from the unconfounded four-, five-, and six-factor interactions. These 
comparisons supply 22 d.f., of which three are confounded with blocks, 
leaving 19 for the estimate of error. The separation of the 63 d.f. is then 
as follows: 


df. 
Blocks 3 
Main effects 6 
Two-factor interactions 15 
Three-factor interactions 20 
Error (from high-order interactions) 19 


2° factorial, blocks of 8 units. In plan 6.6, the size of block is reduced 
from 64 to 8 units without confounding any two-factor interactions. 
Four of the 20 three-factor interactions and 3 of the 15 four-factor inter- 
actions are confounded in each replication. The balanced design requires 
10 replications, 1.е., 640 experimental units, and necessitates a large ex- 
periment. 

With a single replication, the 63 d.f. in the analysis of variance sub- 
divide as follows: 


d.f. 
Blocks 7 
Main effects 6 
Two-factor interactions 15 
Three-factor interactions 16 
Error (from high-order interactions) 19 


The simplest procedure when there is more than one replication is to 
confound completely a single set of 7 factorial effects by the repeated use 
of replication 1. For partial confounding, the successive replicates in 
plan 6.6 may be taken. In the first 4 replicates no three-factor inter- 
action is confounded more than once. 


6.32 3” Series. From plan 6.7 onwards, the levels or variations of 
each factor are denoted by the figures 0, 1, 2, >>>, while the factors are 
read from left to right. The treatment 102 implies the middle level of 
the first factor, the lowest level of the second factor, and the highest 
level of the third factor. In an experiment the numbers 0, 1, 2, --- may 
be assigned to the levels or variations of a factor according to any con- 
venient scheme. 

The testing of three instead of two variations of each factor makes 
possible a more thorough evaluation of the effects of the factors. In 
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particular, the 3° factorial experiment has been much used in investiga- 
tions of the responses to fertilizers, since the effects of each of 3 different 
plant nutrients may be explored at 3 levels of application, which provide 
some information on the shape of the response-curve. 

83 factorial, blocks of 9 units. In plan 6.7, two of the 8 d.f. for АВС are 
completely confounded in each replication, so that the balanced design 
requires 4 replicates and provides 3/4 relative information on ABC. 
Unless the three-factor interactions are known to be negligible, in which 
case replication I may be used for all replicates of the experiment, it is 
preferable to select the required number of replications in succession 
from the plan. For example, an experiment with 3 replicates gives 2/3 
relative information on 6 of the 8 interaction d.f., and full replication on 
the remaining 2 d.f. As usual, the rule in the analysis of variance is to 
estimate confounded comparisons from those replications in which they 
are not confounded. An example of the analysis with 4 replicates has 
already been given (section 6.17). 

For an example of the analysis with a single replicate, see reference 
(6.1), р. 53. If the three-factor interactions are used as error, the separa- 
tion of the degrees of freedom is as shown on the left in table 6.25. 


TABLE 6.25 SUBDIVISION OF THE DEGREES OF FREEDOM IN A 3? EXPERIMENT 
WITH SINGLE REPLICATION 


I. Three-factor interactions П. For experiments where all 

used as error effects are approximately 
linear 

d.f. d.f. 

Blocks 2 Blocks 2 

Main effects 6 Main effects 6 

Two-factor interactions 12 AB: linear X linear 1 

Error 6 AC: linear X linear 1 

- BC: linear Х linear 1 

Total 26 Error 15 

Total 26 


The number of error d.f., 6, is rather small. There is an alternative 
scheme for experiments where the effects of the factors are linear, or 
nearly so, Under these circumstances, the only components of the two- 
factor interactions that are likely to be substantial are the interactions of 
the linear responses to the factors. Hence, nine of the 12 d.f. for two- 
factor interactions are added to the material used for an estimate of error, 
as shown in table 6.25, II. If in addition the linear components of 
the two-factor interaction are large, a further variant is to remove from 
the error the ABC: linear X linear X linear term. This leaves 14 d.f. 
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for error. The calculation of these components should be fairly simple 
with the help of the example worked in section 6.17. 

3* factorial, blocks of 9 units. With 4 factors each at 3 levels (plan 6.8) 
the block size is reduced from 81 to 9 units by confounding three-factor 
interactions. There are 4 three-factor interactions: ABC, ABD, ACD, 
and BCD, each with 8 d.f. Two components from each set of 8 are con- 
founded in any replication. If the experiment has more than 1 replicate, 
partial confounding is generally preferable, the required number of rep- 
licates being selected from the balanced set. The balanced design, with 
4 replicates, gives threefold replication on all three-factor interactions. 

Statistical analysis with a single replication. An analysis may be cal- 
culated without difficulty from the following partition of the degrees of 
freedom. 


d.f. 
Blocks 8 
Main effects 8 
‘Two-factor interactions 24 


Error (from three- and four-factor interactions) 40 


All sums of squares are calculated in the usual way. For the main effects 
and two-factor interactions, six 3 X 3 tables are required. The error, 
which is found by subtraction, contains 24 d.f. representing uncon- 
founded three-factor interactions and the 16 d.f. for the four-factor inter- 
actions. 

This analysis gives no test of the three-factor interactions. If a test 
of ABC, for example, is wanted, the first step is to calculate in the usual 
way the sum of squares for the 8 44. From this we must subtract the 
sum of squares for the 2 d.f. that are confounded with blocks. 

In plan 6.8, replication I, the 2 d.f. are denoted by ABC I, where the 
numeral I refers to replication I of plan 6.7. If we ignore D, it will 
be found that blocks 1, 6, and 8 contain the same set of 9 treatments as 
block Ia in plan 6.7. Calculate the total yield of these 3 blocks (27 
units). Similarly, we calculate the total yield of blocks 3, 5, and 7, which 
correspond to block Ib in plan 6.7, and of blocks 2, 4, and 9, which cor- 
respond to block Ic. The sum of squares of deviations of the 3 totals, 
divided by 27, is subtracted from the sum of squares for the 8 d.f. The 
resulting 6 d.f. are removed from the error, which now has 34 d.f. 

The reader may verify that the 2 confounded degrees of freedom for 
BCD (i.e., BCD II) are obtained from the totals of blocks 1, 2, 3, blocks 
4, 5, 6, and blocks 7, 8, 9. In this case A is ignored when plan 6.8 is com- 
pared with plan 6.7. 

Statistical analysis with more than one replication. И all three-factor 
interactions are presumed to be negligible, they may be combined with 
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the error. The sums of squares for blocks, main effects, and two-factor 
interactions are found in the usual way, while the remainder constitutes 
the error. 

All 16 components, each with 2d.f., of the three-factor interactions 
may, however, be calculated and tested. Partially confounded compari- 
sons are taken only from replicates in which they are unconfounded. 
For example, if there are two replications of plan 6.8, all the data are 
used in the calculation of components ABC III and ABC IV, which are 
unconfounded. For ABC II we use only the data from the first replicate 
of the experiment, and for ABC I only the data from the second replicate. 


6.33 Mixed Series. 3 х 27 factorial, blocks of 6 units. Only the bal- 
anced plan (6.9) should be used; this may be repeated if extra replica- 
tions are wanted. Thus the number of replicates must be 3, or some 
multiple of 3. Although the BC interaction is partially confounded, the 
relative information is high (8/9). 

The statistical analysis is described, with a numerical example, in 
reference (6.1), p. 58. 

8° X 2 factorial, blocks of 6 units. In plan 6.11, AB and ABC are 
partially confounded. Either 2 or 4 replicates may be used, though 4 
are required for complete balance. A numerical example for 4 replicates 
was given in section 6.19. 

With 2 replicates (say I and II in plan 6.11) the J and В components 
are partially confounded, while the J and S components are uncon- 
founded. The J and R totals are adjusted by the same formulae as is 
table 0.10, (iii) and (iv). The J and S totals need no adjustment. The 
interaction s.s. in the analysis of variance are computed as follows. 


AB(I) = 35101)? + (21)? + (21,)?] — 145121, + 21, + 2137]? 
АВ() = т, + Jo? Ја] l Л + Jal? 
ABC(R) = 112879 + (2R)? + (98,74 
= 3561281 + 2R? + 28:2 
ABC(S) = 718° + 82? + S37] — 518, + So + Ss]? 


The AB two-way table and the table of individual treatment means 
are adjusted in the same way as with four replicates, The quantities 27’, 
2R’ are divided by 18 and 6, respectively, to obtain 7 and 7’. For i and 
т, we divide J and В each by 12. After taking deviations from the mean 
in each case, we calculate ôi and ôr. Thereafter the adjustment proceeds 
exactly as with 4 replicates, being somewhat simpler since J and S com- 
ponents do not enter. 
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8 X 2? factorial, blocks of 6 units. In plan 6.10 the interactions BC, 
BD, and CD, between pairs of factors that occur at 2 levels, are partially 
confounded with 8/9 relative information, while 5/9 relative information 
is retained on the interactions АВС, ABD, and ACD. Only the balanced 
design which requires 3 replicates is recommended. 

The following computations lead to the sum of squares due to partially 
confounded effects. Let 


g = la + h a e lajn ph datil 41. — Ig 
g” = la — lo — le + Ia 


where I, is the total of block Та, ete. Similar definitions hold for до, * +, 
gs” in the other replications. Then find 


30 = 3[CD] + gı + 92 + gs; 3Q' = 3[BD] + gr’ + ge! + 995 
30" = 3[BC] + g” + ge" + gs" 
3Ro = 3[CD.] — 91 + ge + 93; ЗЕ, = 3[CD4] + 91 + 92 — 935 
ЗВ, = 3[CD.] + 91 — 92 + gs 
3Ro! = 3[BD.] + gi’ — 9! + 93'; ЗЕ = 3[BD4] — Ф + 92 +з; 
3R»' = [ВБ] +: + go’ — gs" 
Зо" = 3[BCa] +91" +92" — 93"; ЗВ" = [ВС] +91” — ga" +08": 
ЗЕ” = 3[BC4] — gi” + ge" + ga" 
The sum of squares for the two-factor interactions are as follows. 
_ BQ? | BD 39. _ BQ")? 


CD A Е BC 
576. 576 576 


For ACD, divide the sum of squares of deviations of the quantities 3Ro, 
ЗВ, ЗВ by 120, with similar rules for ABD and АВС. 

The CD, BD, and BC two-way tables must be adjusted for block ef- 
fects. For the means per unit in the BC table, the adjustment is 


3Q" [BC] 
192 72 


The adjustment is added (algebraically) when b and c are both present 
or both absent; when b or c alone is present the adjustment is subtracted. 

4? factorial, blocks of 4 units. In plan 6.12 each replication confounds 
completely a set of three of the 9 d.f. for АВ. Since the relative informa- 


226 CONFOUNDING 6.33 


tion on AB is 2/3, this plan gives less precise estimates of the interactions 
than randomized blocks unless the reduction in block size brings a sub- 
stantial decrease in the error m.s. The balanced design, which requires a 
multiple of 3 replicates, is preferable. 

With 3 replicates the analysis of variance is as follows. 


d.f. 

Replicates 2 
Blocks within replicates 9 
Main effects 6 
Two-factor interactions 9 
Error 21 
Total 47 


To compute the sum of squares for AB, let Bmn be the total of the nth 
block in the mth replicate, and let Tmn be the total of all treatments that 
appear in this block (this will be a total over 12 units). Let Pmn = 
Tmn — Bmn, 30 that there are 12 quantities Р,,. Then 


АВ = $ (Ри? + Pi? +++++ Poa”) — 35 (6? + №? + Ви? + Ёш?) 
where @ is the grand total and the R’s are replication totals. 
The estimated block mean (adjusted for treatment effects) is 


b. = 3B, n Tmn 
mn 8 


To present a two-way table, the adjusted mean of treatment (00) is, for 
example, 


(00)’ = (00) — (ть + Виа + bro) 


since this treatment occurs in blocks Ib, Па, and IIIb. 

4 X 2 factorial, blocks of 8 units. In plan 6.13, one of the three com- 
ponents of ABC is confounded in each replicate. In other words, the dif- 
ference between the treatments in block Ia and those in block Ib is one 
component of ABC. The plan gives 2/3 relative information on ABC. 
Any number of replicates шау be used, though three are needed for 
balanced confounding. 

To obtain the sum of squares for ABC, calculate the total for each 
component separately, taking it only from those replicates in which it is 
not confounded. The divisor for the square of this total is obtained by 
the usual rules. Thus with 2 replicates the first component is taken from 
replicate П and has divisor 16, the second component is taken from rep- 
licate I and has divisor 16, while the third component comes from both 
replicates and has divisor 32. 
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If there are r replicates, the adjusted block mean is 
TBmn — Tmn 
рее ERE 
8(r — 1) 


where Ви» is the block total and Tmn that of all treatments appearing in 
the block. With 2 replicates, the adjusted mean of treatment 111 is 


(іу = а) = м. Ұш» 


since this treatment is found in blocks Ia and ІТ. 
4 X 8 X 2 factorial, blocks of 12 units. The 3 d.f. for the factor A at 
4 levels may be divided into the following components. 


А’ = аз + а — a, — а 
A" = аз — а — a + а 
Al” = ag — а + а — а 


In the first 3 replicates of plan 6.14, the interactions A'C and A'BC 
are partially confounded, with 8/9 and 5/9 relative information, re- 
spectively, the interactions of A" and А”” being unconfounded; АС and 
A" BC are partially confounded in replicates 4 to 6 and A'"C and A” BC 
in replicates 7 to 9. Although nine replicates are required for complete 
balance, the design may be used with three or six replicates. In these 
cases the last three or the last six replications should be taken from plan 
6.14. The AC interaction suffers only a trivial loss of replication from 
the confounding. The statistical analysis is described in reference (6.6). 


6.34 Missing Values. When factorial experiments are arranged in 
randomized blocks or latin squares with no confounding, the formulae 
previously given (sections 4.25, 4.36) are used for the estimation of miss- 
ing values. With confounding, the formulae are changed, becoming in 
general more complicated. The different cases are discussed below. 


i. Complete confounding of the 23, 24, 25, 29, 3°, 8% factorials. When 
the effects that are confounded are unimportant, we recommended that 
the first replication of the plan be used for all replications of the experi- 
ment, with complete confounding of these effects. In this case the esti- 
mation formula is simple. Let B be the total yield of all other units in 
the same block as the missing unit, and T be the total yield of all other 
units which have the same treatment as the missing unit. Every replica- 
tion contains a block which has the same set of treatments as the block 
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with the missing value. Let B’ be the total yield of all such blocks (in- 
cluding the block with the missing value). 
The value for the missing unit is estimated by the formula: 


7B + ЕТ — В’ 
KT ЕЕ) 


where k is the number of units per block, апа r is the number of replica- 
tions. 

It will be realized that this is the standard formula for randomized 
blocks (section 4.25), applied only to those blocks which contain the 
treatment with the missing value. 


ii. Partial confounding of the 2°, 2*, 2°, 2°, 33, 34, Д2, 4 X 2? factorials. 
The formula given later is valid when a different replication of the plan 
is used for each replication of the experiment. A further essential con- 
dition is that no treatment comparison be confounded in more than one 
replication. This condition is satisfied in all plans for the designs above, 
except possibly plan 6.4 (2* in blocks of 4) and plan 6.6 (2° in blocks of 
8). For instance, if the first 2 replications of plan 6.6 are used, no inter- 
action is confounded more than once; but if the first 3 replications are 
used, BCDE is confounded twice and the formula does not apply. 

The formula requires some preliminary calculations, most of which 
can be used in the subsequent analysis of variance. In addition to B and 
Т, as defined in case i, we require first the total R of all other units in the 
same replicate as the missing unit and the grand total G of all other units 
in the experiment. In the other replicates, calculate the total S; of all 
blocks which contain the treatment with the missing unit. 

Next form the table of treatment totals. From this table compute 
the sum U of the observations for all other treatments which appear in 
the block which has the missing unit. In the other replications, calculate 
the corresponding sum for each block which contains the treatment with 
the missing unit. Let the total of these (r — 1) sums be V, where r is 
the number of replicates. Note that in the calculation of U and V the 
treatment with the missing value is not included. 

The missing yield is estimated by the equation 


id kt(r — 1)T + tr(r — 1)B + EG + tV — krR — ur — ПО — #5 
C = Dir — 1) — 1) — ( — Ю] 
The table on p. 229 which shows the number of units whose observa- 
tions are added to obtain the various totals T, В, ---, may be helpful in 


checking that the totals have been correctly identified. 
Consider U, for example. There are (k — 1) other treatments in the 
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block with the missing value, and each treatment total is the sum of r 
observations. Consequently U is a total of r(k — 1) individual observa- 
tions. 


Total T B G V Е U So 


Numberof 
units (r — 1) | (k — 1) | (tr — 1) | r(r — 1)(k — 1) | (t — 1) | r(k — 1) | ke — 1) 


iii. Partial confounding of the 8 X 27, 8 X 2, 3 X 2, 4X 3 X 2 fac- 
torials. For these designs the general method given by Yates (6.16) 
should be followed. 

iv. Single replication of the 2°, 29, 33, 34 factorials. A missing value is 
estimated by minimizing the sum of squares for the interactions that are 
used as error. 

The estimation of missing values in cases iii and iv requires a knowl- 
edge of the algebra of the analysis of variance. The expressions for the 
missing value are usually fairly simple in case iv but are complicated in 
case 11. 

When the estimated value has been substituted, the analysis of vari- 
ance is calculated by the appropriate method for the design. As usual, 
the error d.f. are reduced by one for each estimated value. Variance- 
ratio tests of the treatment effects are slightly disturbed. For t-tests, ап 
approximate rule is to decrease the number of replicates ascribed to any 
mean by 114 for each estimated value that the mean contains. Suppose, 
for instance, that (abc) is missing in a 2° experiment with 3 replications. 
The mean of the units which receive a is taken over 12 observations, one 
of which is an estimated value. By the rule, the standard error of this 
mean is computed as s/V 101 instead of 3/V 12. Consequently the 
standard error of the mean response to A is taken as 


‘cw 
et (pci rs 
10} 12 


252 


12 


instead of 


6.35 Estimation of the Gain in Precision from Confounding. Аз men- 
tioned previously, the advisability of confounding depends on the 
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amount by which the experimental error is decreased when the block size 
isreduced. It is therefore worth while, especially where confounding has 
not been previously used, to estimate the gain in precision which has been 
obtained in each experiment. 

The technique is essentially the same for all types of design. From the 
results of the analysis of variance of a confounded experiment, an esti- 
mate E, is made of the experimental error which would have been 
present if the experiment had been laid out in randomized complete 
blocks. 

Single replication. Let Еу be the mean square for blocks, and Е. the 
mean square for error. If {Неге is only 1 replication in the experiment, 
the estimate Æ, is 
_ mE, + nl 

Ny + Пе 
where n; = number of degrees of freedom for blocks. 
n, = total number of degrees of freedom minus ny. 


E, (6.1) 


For example, with a 2° experiment in blocks of 8 units (single replication) 
ль = 7, п. = 63 —7 = 56 


The estimate E, is directly comparable with the estimated error Ee. As 
shown in section 2.31, the comparison may be expressed in terms of the 
relative amounts of replication required to obtain equal accuracy with 
the two types of layout. 

The estimate Е, is subject to the assumption that the interactions 
which are confounded with blocks are negligible. Since confounding is 
complete, no method of estimation avoids this assumption. If the con- 
founded interactions are not negligible, the calculation overestimates the 
gain from confounding. 


More than one replication. If the confounding is complete, formula 
6.1 holds for Е,, subject again to the assumption that the confounded 
comparisons are negligible. In this case 


Еь = mean square for blocks within replications. 

E, = error mean square. 

пь = number of degrees of freedom for blocks within replications. 

пе = total number of degrees of freedom minus number of degrees of 
freedom for replications minus np. 


Thus with 2 replicates of a 2° design in blocks of 8 units, the same inter- 
actions being confounded in both replicates, we have 


пь = 14, п. = 127 — 1—14 = 112 
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With partial confounding, the same formula may be used if all con- 
founded comparisons are assumed to be small. Evidence on the validity 
of this assumption is provided by the experimental results, since partially 
confounded comparisons can be tested for significance. 

It is possible to avoid the assumption that confounded comparisons are 
negligible, though some extra computations are necessary. We first 
adjust the mean square among blocks within replicates so as to remove 
treatment effects. The following identity holds for the sums of squares. 


Blocks within replicates (adj.) + treatments (unadj.) = 
blocks within replicates (unadj.) + treatments (adj.) 


The two quantities on the right of the equation appear in the analysis 
of variance of the experimental results. Consequently the adjusted 
blocks s.s. is found from this equation by calculating the unadjusted 
treatments в.в. From the adjusted blocks s.s. we obtain, as shown 
below, a new estimate Ep’ for use in formula (6.1). 

These procedures may be illustrated from the results of the 3 X 3 X 2 
experiment in 4 replicates (section 6.19), in which AB and ABC are 
partially confounded. From table 6.15 the following summar” of the 
analysis of variance is obtained. 


d.f. 8.8. m.s. 

Replications 3 3,837 1,279 

Blocks within replications 8 2,836 354 

Treatments 17 7,802 459 

Error 43 8,909 207 
Total 7" 23,384 


If AB and ABC are assumed negligible, the estimate E, is found by 
formula (6.1) for the case where there is more than one replication. 
8 X 354 + 60 х 207 
Mars umeros. ded 
68 
In order to avoid the assumption, we caleulate the unadjusted treat- 
ments s.s., which is found to be 8328. Since the adjusted treatments 
5.3. is 7802, we have 
Blocks s.s. (adj.) = 2836 + 7802 — 8328 = 2310 
corresponding to a mean square Еь, of 289. 
An unbiased estimate Еу of the variance between blocks within rep- 


licates is А 
, _ 4Еь, — Е. 4 X 289 — 207 


р 3 


т 


= 316 
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This value is inserted in formula (6.1) to give E": 
; аа ВИ lee are 
68 

The estimated gain in efficiency over randomized blocks is 8% when 
AB and ABC are assumed negligible and 6% without this assumption. 
The two results agree closely, as is usually the case unless the confounded 
interactions are large. 

The method above applies to all designs with partial confounding. 
For the designs discussed in this chapter, the formula for Æ,’ in terms of 
E,, and Е, is 


, 


TE, — E, 
йыл cadit 
r— 1 


where т is the number of replicates. 
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PLANS 
Plan 6.1 2° factorial, blocks of 4 units 
Rep. I, ABC confounded 


Plan 6.2 2 factorial, blocks of 8 units 
Rep. I, ABCD confounded 


= 
= 


1 


ІЗГІ 
RERTES 


bed abcd 


Plan 6.3 28 factorial, blocks of 16 units 
Rep. I, ABCD, ABEF, CDEF confounded 


Plan 6.4 


PLANS 


Balanced group of sets for 24 factorial, blocks of 4 units 


Two-factor interactions are confounded in 1 replication and three-factor 
interactions are confounded in 3 replications. The columns are the blocks. 


Rep. I, AB, ACD, BCD confounded 


abd a 


Plan 6.5 


Rep. II, AC, ABD, BCD 


Balanced group of sets for 25 factorial, blocks of 8 units 


Three- and four-factor interactions are confounded in 1 replication. 


Rep. I, ABC, ADE, BCDE confounded 


(D ab 
bc ас 
abd d 
acd bed 
abe е 
ace bce 
de аме 


a b 
абс с 
bd ad 
be abed 
ce ae 
ade ^ abce 
abcde bde 
са cde 


() ab 
ad bd 
abe с 
bed асі 
abe е 
bde ade 
ce abce 
acde bede 


Rep. П, ABD, BCE, ACDE 


а 
greene go 
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Plan 6.5 (Continued) : 
Balanced group of sets for 2° factorial, blocks of 8 units 


Rep. III, ACE, BCD, ABDE Rep. IV, ACD, BDE, ABCE 
(D^ "ac^ «a c (1) ad а d 
ae се е асе ac с с acd 
abc b bc ab abd b bd ab 
bce abe abe be bed abc аһа be 
«а а са аа ade е de ae 
cde ade acde de cde ace acde ce 
bd abd abd bed be abde abe bde 
abde bede bde abcde abce bede bce abcde 


Rep. V, ABE, CDE, ABCD 


(1) ae a е 
ab һе b abe 
ace c ce ас 


bee ас abe bc 
ade d de аа 
Ме abd аме bd 

cd асйе acd сае 
abcd bede bcd abcde 


Plan 6.6 Balanced group of sets for 2° factorial, blocks of 8 units 
All three- and four-factor interactions are confounded in 2 replications. 
Rep. I, ABC, CDE, ADF, BEF, ABDE, BCDF, ACEF confounded 


abe a b (1) be ac с ab 
bd са abed acd abd 4 ad bed 
ae abce се bee е abe be ace 


cde Ме ade аһіе acde bede abcde de 
of bf af ау ау bf abf f 
adf ооу cdf Ый df ау һај  acdf 
bef сј  abcef ав) abf ef ај beef 
abcdef adef bdef def — bcdef acdef cdef — abdef 


Rep. II, ABD, DEF, BCF, ACE, ABEF, ACDF, BCDE 


add b a 0) ad ы а ab 
cd . ac. be ас bd acd abd c 
be abde de ade e MS ш Is 
ace cde abcde bede аке се bee acde 
af af abdf Һај abf f of adf 
ЫЈ abedf cdf ай cf ау ај оу 
def ај bef аһеј bdef adef abdef ef 
abcdef beef асеј сеј — acdef bedef сіе — abcef 
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Plan 6.6 (Continued) 
Balanced group of sets for 25 factorial, blocks of 8 units 


Rep. III, ABE, BDF, ACD, CEF, ADEF, BCDE, ABCF 


bc a ac (1) abe ab b c 
acd bd bed abd cd d ad abcd 
abe ce е асе be bce ае ae 

de abcde abde  bcde ade acde cde bde 
af bef bf abf f сў acf abf 
bdf асај ай cdf  abdf abedf bedf df 
cef аж abcef bef асе} aef ef һеј 
abcdef def саў adef bcdef bdef abdef acdef 


Rep. IV, ABF, CDF, ADE, BCE, ABCD, BDEF, ACEF 


ac a b 8) e abc be ab 
bd bed acd abd abd а ad са 
bee be ae abe abe се ace е 
ade acde де cde de аме bde abcde 
abf ау cf Ыы Я Шил acf 
cdf df abdf adf acdf bedf аһсај һау 
ef cef аме) асеў aef bef abef beef 
abcdef abdef def ^ bdef bedef acdef cdef adef 


Rep. V, ACF, BCD, ADE, BEF, ABDF, CDEF, ABCE 


ab a bc а) b ас с ас 


acf abf f b cf abf bf af 

df bdf ас}  abdf adf Һау abcdf cdf 
bef ef abcef aef  abef сеј асеў beef 
abcdef acdef bdef cdef bedef adef def ^ abdef 


Rep. VI, ABC, BDE, ADF, CEF, ACDE, BCDF, ABEF 
Interchange B and C in replication I 

Rep. УП, АВЕ, DEF, BCD, ACE, ABDE, ACDF, BCEF 
Interchange F and D in replication II 

Rep. VIII, ABE, BDF, CDE, ACF, ADEF, ABCD, BCEF 
Interchange А and E in replication III 

Rep. IX, ABD, CDF, AEF, BCE, ABCF, BDEF, ACDE 
Interchange F and D in replication IV 

Rep. X, АЕР, BDE, ACD, ВСЕ, ABDF, CDEF, ABCE 
Interchange E and C in replication V 


238 
Plan 6.7 


CONFOUNDING 
Balanced group of sets for 3° factorial, blocks of 9 units 


ABC confounded 
Rep. I, ABC(W) * Rep. II, ABC(X) 
a p Wc a b c 
000 100 200 000 100 200 
110 210 010 110 210 010 
220 020 120 220 020 120 
101 201 001 201 001 101 
211 011 11 011 111 211 
021 121 221 121 221 021 
202 002 102 102 202 002 
012 112 212 212 012 112 
122 222 022 022 122 222 


а 


* Yates’ notation. 
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b 


€ 


Rep. IV, ABC(Z) 


b 


100 200 
010 110 


120 220 020 


001 101 


111 211 Oll 


121 221 


102 202 002 


112 212 
222 022 122 


Balanced group of seta for 3* factorial, blocks of 9 units 


0000 
1011 
2022 
0121 
1102 
2110 
0212 
1220 
2201 


0011 
1022 
2000 
0102 
1110 
2121 
0220 
1201 
2212 


Three-factor interactions confounded 


Rep. I, ABC I, ABD III, ACD IV, BCD II confounded 


0021 
1002 
2010 
0112 
1120 
2101 
0200 
1211 
2222 


0002 
1010 
2021 
0120 


Plan 6.8 (Continued) 
Balanced group of sets for 3 factorial, blocks of 9 units 


П, ABC II, ABD IV, ACD I, BCD Ш 


Rep. 


PLANS 


0022 


0011 
1000 
2022 
0120 
1112 
2101 
0202 
1221 
2210 


0010 
1002 


0002 
1021 


1212 
2201 


0001 
1020 
2012 


Rep. ІП, ABC IV, ABDI, ACD II, BCD I 


0000 
1021 
2012 
0122 
1110 
2101 
0211 
1202 


0021 
1012 
2000 
0110 
1101 
2122 
0202 
1220 
2211 


0012 


0002 
1020 
2011 
0121 
1112 
2100 
0210 
1201 
2222 


0020 
1011 
2002 
0112 
1100 
2121 
0201 
1222 
2210 


0012 
1021 
2000 
0120 
1102 
2111 
0201 
1210 
2222 


0021 
1000 
2012 
0102 
11 
2120 
0210 
1222 
2201 


0011 
1020 
2002 
0122 
1101 
2110 
0200 
1212 
2221 


Rep. IV, ABC Ш, ABD II, ACD Ш, BCD IV 
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Plan 6.9 Balanced group of sets for 3 X 2° factorial, blocks of 6 units 
BC, ABC confounded 


Rep. I Rep. П Rep. III 
а,” а b ас, Б 
001 000 000 001 000 001 
010 011 011 010 011 010 
100 101 101 100 100 101 
11 110 110 11 111 110 
200 201 200 201 201 200 


211 20 211 210 210 21 


Plan 6.10 Balanced group of sets for 3 X 2? factorial, blocks of 6 units 


BC, BD, CD 
ABC, ABD, ACD confounded 


Rep. I 
a b с а 


0100 0000 0001 0010 
0011 0111 0110 0101 
1010 1001 1000 1100 
1101 1110 1111 1011 
2001 2010 2100 2000 
2110 2101 2011 2111 


Plan 6.11 Balanced group of sets for 3? X 2 factorial, blocks of 6 units 


PLANS 


AB, ABC confounded 


Rep. П 


Plan 6.12 Balanced group of sets for 4? factorial, blocks of 4 units 
AB confounded 
Rep. I Rep. П Rep. Ш 
а b с d a,b cd a b e d 
33 32 31 30 33 30 32 31 33 31 32 30 
22 23 20 21 21 22 20 23 20 22 21 23 
10 11 12 13 12 11 13 10 11 13 10 12 
01 00 03 02 00 03 01 02 02 00 03 01 
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Plan 6.18 Balanced group of sets for 4 X 2? factorial, blocks of 8 units 


ABC confounded 

Rep. I Rep. П Rep. III 

а: zh a b ад. Ы 
000 001 000 001 000 001 
011 010 011 010 011 010 
100 101 101 100 101 100 
11 110 10 11 110 111 
201 200 201 200 200 201 
210 211 210 211 211 210 
301 300 300 301 301 300 
310 311 311 310 310 311 


Plan 6.14 
Balanced group of sets for 4 X 3 X 2 factorial, blocks of 12 units 


A'C, A'BC confounded 


Rep.I Rep. И Rep. III 
а b a b a b 


000 001 001 000 001 000 
011 010 010 011 011 010 


Plan 6.14 (Continued) 


PLANS 
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Balanced group of sets for 4 X 3 X 2 factorial, blocks of 12 units 


A"C, A" BC 
Rep. IV Rep. V Rep. VI 
a b а і a b 
001 000 000 001 000 001 
010 Oll 011 010 010 011 
020 021 020 021 021 020 
100 101 101 100 101 100 
111 110 110 11 111 110 
121 120 121 120 120 121 
200 201 201 200 201 200 
211 210 210 21 211 210 
221 220 221 22% 220 221 
301 300 300 301 300 301 
310 311 311 310 310 311 
320 321 320 321 321 320 
AMC, A"BC 
Rep. VII Rep. VIII Rep. IX 
a b a b a b 

000 001 001 000 001 000 
011 010 010 Oll 011 010 
021 020 021 020 020 021 
101 100 100 101 100 101 
110 111 111 110 110 11 
120 121 120 121 121 120 
200 201 201 200 201 200 
211 210 210 211 211 210 
221 220 221 220 220 221 
301 300 300 301 300 301 
810 811 811 30 310 811 
320 821 320 821 821 820 


A’ =a3+a2—a,—a9 
А" =a3—a2—a\+a9 
Al” 2 a3—a5--a1 — а) 


CHAPTER "6 А 


Factorial Experiments 
in Fractional Replication 


6A.1 Construction and Properties of Fractionally Replicated Designs 


6A.11 Introduction. Often, even a single replication of a factorial ex- 
periment is beyond the resources of the investigator, or it gives more 
precision in the estimates of the main effects than is needed. In a single 
replication of a 2° factorial, each main effect is an average over 32 com- 
binations of the other factors, and hence in effect has 32-fold replica- 
tion. Perhaps 4- or 8-fold replication would suffice. In situations like 
this, an experiment which consists of only part of a complete replication 
is worth considering. 

The use of experiments in fractional replication was proposed in 1945 
by Finney (6А.1). He outlined methods of construction for 2” and 3” 
factorials, and described a half-replicate of a 4 x 2* agricultural field 
experiment that had been conducted in 1942. In 1946, Plackett and 
Burman (6А.2, 6A,3) gave designs for the minimum possible size of 
experiment with p” factorials (p = 2, 3, ... 7) and pointed out their 
utility in physical and industrial research. Since then, the designs have 
found many applications, partieularly in industrial research and de- 
velopment. Their chief appeal is that they enable 5 or more factors to 
be included simultaneously in an experiment of a practicable size, so 
that the investigator can discover quickly which factors have an impor- 
tant effect on the product. ў 

This welcome reduction in the size of the experiment is not obtained 
without paying a price. As will be seen, the results of an experiment in 
fractional replication are open to misinterpretation in a way that does 
not arise with fully replicated designs. Before using these designs, it is 
important to understand the uature of this confusion. 
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6A.12 Aliases. In order to see what happens when the experiment 
contains part of a replication, consider a 2? factorial in which only the 
4 treatment combinations a, b, c, and abc are tested. This is half of the 
complete replicate. From the experimental results the main effects and 
interactions would be estimated as follows (apart from a divisor 2). 


FACTORIAL EFFECTS 


"Treatment 

combination А B с АВ Ас ВС АВС 
а + ті T = em яғ + 
b pa + v = zi = > 
с те = te Tr = = + 
abc + “Р an + + Р F 


For the main effect of A, we add the yields for units that contain a 
and subtract the yields for units that do not, i.e., 


А = (abe) + (а) - (b) - (с) 


and similarly for the main effects of B and C. It will be seen from the 
table that the A, B, and С comparisons are mutually orthogonal. Thus 
the estimates of the three main effects are independent. 

To compute the AB interaction we will follow the “Evens versus 
odds” rule (section 5.24). Add the yields for units that contain an even 
number of the letters a, b, and subtract the yields for units that contain 
an odd number. This gives 


АВ = (abe) + (с) — (a) — (b) 


But this is the same quantity that is used to estimate the main effect 
of C. The name aliases was given by Finney (6A.1) to two factorial 
effects that are represented by the same comparison. Thus C and AB 
are aliases. We write С = AB. The table above also shows that 


АС = В: ВС = А 
The three-factor interaction, ABC, cannot be estimated at all. If 


all 8 treatment combinations were available, ABC would be computed 
from 

ABC = (abe) + (a) + (b) + © — (ab) — (ae) - (be) - (1) 
We have, however, chosen the 4 treatment combinations which carry a 
+ sign in this expression to be our half replicate, so that no comparison 
is possible, ABC is called the defining contrast, since it was the contrast 


used to split the factorial into 2 half-replicates. 
In summary, the outcome of using a half-replicate is to lose one facto- 
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rial effect, ABC, entirely and to leave each main effect inextricably mixed 
with one of the two-factor interactions. 

If the experiment shows an apparent effect of A, there is no way of 
knowing from the results whether the effect is really due to A, to the 
BC interaction, or to a mixture of the two. This kind of confusion is 
always present in experiments with fractional replication, since every 
factorial effect always has one or more aliases. The problem of deciding 
to which alias an effect is to be attributed is one that the experimenter 
must face when he comes to interpret the results. 

In some types of research, previous experience with the same factors, 
or a knowledge of the nature of their actions, may lead the investigator 
to predict confidently that their interactions will be negligible. The 
ambiguity then disappears: we conclude that an apparent effect of A 
is in fact due to A, and likewise for B and C. With these assumptions, 
the half-replicate provides independent estimates of the three main 
effects. 

In other cases, the nature of the results makes one interpretation of 
the data more plausible than another. Suppose that the comparison A 
shows a large effect, but B and C have no apparent effects. It could be 
argued, at least in some types of experimentation, that factors B and C 
are unlikely to have a large interaction, since their main effects are 
apparently absent. The large effect of A would therefore be ascribed 
to A and not be BC. This conclusion could be wrong: some risk of 
misinterpretation cannot be avoided. 

On the other hand, the experiment might show large effects of the 
B and C comparisons and a much smaller effect of the A comparison. 
Since A = BC, we would have to consider the possibility that the ap- 
parent A effect was wholly or partly due to an interaction between the 
large effects of В and С. If an experiment takes only a short time to 
complete, this uncertainty can be resolved by running a new experiment 
consisting of the four combinations, (ab), (ас), (bc), and (1), that were 
omitted in the first experiment. By putting the results of the two ex- 
periments together we have a complete replicate, which gives independ- 
ent estimates of A and BC. This device is, of course, less practicable 
in lines of work where an experiment takes a long time to produce its 
results. It follows that more risks can be taken with fractional replica- 
tion when experiments are short than when they are long. 

In order to use fractional replication with the minimum risk of mis- 
interpretation, it is important to know what are the aliases of the fac- 
torial effects in which we are interested. The alias of any factorial 


effect can be found by a simple rule, which for the moment we will state 
for half-replicates. 
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In the 2" system, the alias of any factorial effect is its generalized inter- 
action with the defining contrast. 

To illustrate, the defining contrast in this example is ABC. The alias 
of A is therefore the “interaction” AABC, ог A?BC. In interpreting a 
generalized interaction, any squared terms are canceled (as mentioned 
previously in section 6.13). Thus A?BC is read as BC. Similarly, the 
alias of B is AB?C or AC. 

Does it make any difference which half-replicate is chosen? This 
choice affects the alias relationships in a way that is worth noting. Sup- 
pose that instead of (abc), (a), (b), and (c) we take the other half: (ab), 
(ac), (bc), and (1). The main effect of A (apart from the divisor) is 


А = (ab) + (ac) - (be) - (1) 


In computing the BC interaction, we give a + sign to combinations 
having both or none of the letters b and с; i.e., 


BC = (be) + (1) — (ab) — (ac) 


Note that А is now equal to the BC interaction with its sign changed: we 
write А = — ВС. With the original half-replicate, on the other hand, А 
and BC are identical; 1.е., А = 4- BC. 

То pursue the issue further, suppose that B and C are factors whose 
interaction is positive, in the sense that the combination (bc) gives à 
higher response than would be expected from the sum of the responses 
to (b) and (c) alone. With the half-replicate (ab), (ac), (bc), and (1), the 
presence of this interaction will make the observed effect of A smaller 
(algebraically) than the true effect of A. With the original half-repli- 
cate, on the other hand, the true effect of A is overestimated if a positive 
BC interaction exists. 

This issue should seldom be of practical importance, because it is not 
advisable to conduct an experiment in which A and BC are aliases if 
both effects may be of the same order of magnitude. The distinction 
between the + and — signs in an interaction alias is worth remember- 
ing, however, when we are trying to make sense of an experiment whose 
results are different from our anticipations. 

In the next two sections we discuss a half-replicate of the 2° factorial 
and а half- and quarter-replieate of the 2° factorial, in order to become 
familiar with the properties of fractional replications in larger experi- 
ments. 


64.13 A Half-replicate of a 2° Factorial. The experiment is to con- 
sist of 16 of the 32 possible treatment combinations. Since all informa- 
tion is lost on the factorial effect that is used to divide the replication 
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into halves, the obvious choice for the defining contrast is the five-factor 
interaction ABCDE. Having made this choice, we can write down the 
alias of any other factorial effect by the rule just given. These aliases 
can be examined to see if they are suitable for our purpose before going 
to the trouble of writing down the plan itself. The alias of A, for in- 
stance, is A?BCDE, i.e., the four-factor interaction BCDE. Тһе com- 
plete set of aliases appears in table 6A.1. 


TABLE 6АЛ ALIASES IN А HALF-REPLICATE OF А 2° FACTORIAL 


Defining contrast: ABCDE 


Main Two-factor 
effects Alias interactions Alias 
A BCDE AB CDE 
B ACDE AC BDE 
с ABDE AD BCE 
D ABCE AE BCD 
E ABCD BC ADE 
BD ACE 
BE ACD 
CD ABE 
CE ABD 
DE ABC 


Every main effect has a four-factor interaction as alias, and every 
two-factor interaction has a three-factor interaction as alias. No main 
effect is mixed up with a two-factor interaction as oceurred in the 2° 
design. In practice, this design can be used in at least three different 
ways. 

(i) If all interactions among 3 or more factors can be assumed negli- 
gible, the design gives independent estimates of all main effects and all 
two-factor interactions. Its chief defect is that it does not furnish any 
d.f. for the estimation of error. Since there are 16 observations in the 
whole experiment, the total sum of squares has 15 d.f., and of these, 5 
are allotted to main effects and the remaining 10 to the two-factor in- 
teractions, If estimates of the error variance have been made in previ- 
ous experiments, and these have shown stability from one experiment 
to another, they might be pooled to provide rough standard errors and 
tests of significance, 

(ii) If all two-factor interactions are considered likely to be absent or 
small, the experiment can be used to estimate only the 5 main effects. 
The 10 d.f. attributed to the two-factor interactions provide the estimate 
of error. 


Gii) If only a few of the two-factor interactions appear likely to be 


64.14 HALF- AND QUARTER-REPLICATES OF А 2? EXPERIMENT 249 


substantial, say АВ, АС, and BC, the scheme of analysis may be to 
estimate all main effects plus these 3 interactions. An estimate of error 
is made from the remaining 7 d.f. 

Тп situations (ii) and (ili), the experimenter should avoid any tempta- 
tion to pick out for estimation those interactions that appear to be large 
and to use the remainder to estimate the error variance. This practice 
can produce a serious underestimation of the true error variance. 

To write down the plan, we take cither the 16 treatment combinations 
that contain 5, 3, or 1 of the letters a, b, с, а, е, or the 16 that contain 
4, 2, or none of these letters. 


TABLE 6А.2 Tue TWO IALF-REPLICATES OF A 2° FACTORIAL 


Defining contrast: ABCDE 


+ABCDE —ABCDE 

abcde abcd 
abc abce 
abd abde 
abe acde 
acd bede 
ace ab 
ade ac 
bed ad 
bee ae 
bde be 
cde bd 

a be 

b cd 

c ce 

d de 

e (1) 


The rule for finding aliases is easily extended to give the sign of the 
alias. One of the half-replicates contains the treatment combination in 
which all the letters appear (i.e., abede in the 2° factorial). If this half- 
replicate is selected, call the defining contrast +. If the other half is 
chosen, the defining contrast is given a — sign. In finding aliases, the 
signs are combined by the usual laws of algebra. The interaction of A 
with —ABCDE is —BCDE: the interaction of —BC with —CDE is 
+BDE. 


64.14 Half- and Quarter-replicates of a 2° Experiment. The half- 
replicate, which requires 32 units, will be discussed only briefly, since 
the principles are the same as in the 2° factorial. The best defining con- 
trast is ABCDEF. Every main effect then has a five-factor interaction 
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as alias (e.g., А = BCDEF), and every two-factor interaction has a 
four-factor interaction as alias (e.g., AB = CDEF). There are 20 three- 
factor interactions, which divide themselves into 10 pairs of aliases (e.g., 
ABC = DEF). Consequently, in order to obtain valid estimates of all 
main effects and two-factor interactions, we need only assume that four- 
and five-factor interactions can be neglected. If the three-factor inter- 
actions are also negligible, the 10 d.f. for the 10 alias pairs can be used 
as an estimate of error. In this scheme the partition of d.f. is as follows. 


df. 

Main effects 6 

Two-factor interactions 15 
Error (from three-factor 

interactions) 10 

Total 31 


If the half-replicate corresponding to + ABCDEF is selected, the plan 
consists of all treatment combinations that contain 6, 4, 2, or none of 
the letters abcdef (table 6A.3). 


TABLE 6A.3 A HALF-REPLICATE OF A 2° FACTORIAL 


abcdef ама аже аһ] аме abdf abef acde 
асај асеј adef bede bedf beef bdef cdef 
ab ac ad ae af с мм be 
bf cd е с de df e (9) 


To obtain a quarter-replicate, another factorial effect is used as a 
defining contrast to divide the 32 combinations in table 64.3 into 2 sets 
of 16. Since this factorial effect will not be estimable, it is natural to 
try a five-factor interaction, such as ABCDE. Taking +ABCDE, we 
select from the 32 combinations the 16 that contain 5, 3, or 1 of the let- 
ters abede. It may be verified that we obtain the following 16 combina- 
tions: 

abcdef abef аһа} abef асау асеў adef bedf 
beef  bdef cdef af bf d а ef 


These are the 16 combinations that have a + sign in the expression for 
ABCDEF and also in the expression for ABCDE. Note, however, that 
all 16 combinations contain the letter f. We have inadvertently pro- 
duced a design from which the main effect of F cannot be estimated, 
there being no tests made at the lower level of F. In other words, F 
has become a defining contrast: if F had been used instead of ABCDE 
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to construct the quarter-replicate, the same set of 16 treatments would 
have been obtained. 

This example is an illustration of a general rule. 

In the 2" system, any two factorial effects may be used as defining con- 
trasts to divide а factorial experiment into quarter-replicates. Their gen- 
eralized interaction also acts as a defining contrast, and cannot be estimated 
from the quarter-replicate. 

Тһе generalized interaction of ABCDEF and ABCDE is A?E*C?D?E?F, 
or F. This rule is reminiscent of the rule that applies (section 6.13) 
when a replicate of a factorial experiment is divided into 4 blocks by 
confounding factorial effects. 

Before trying to improve on this design, we will state the rule govern- 
ing aliases: 

In а quarter-replicate in the 2” system, any factorial effect that is not a 
defining contrast has 3 aliases. These are its generalized interactions with 
the 3 defining contrasts. 

For instance, the aliases of A are its interactions with ABCDEF, 
ABCDE, and F, 1.е., BCDEF, BCDE, and AF. To verify this, note 
that from the 16 treatment combinations that are available, A is esti- 
mated (apart from a divisor) as 


A = (abcdef) + (abef) + (abdf) + (abef) 
+ (acdf) + (acef) + (adef) + (af) — (bedf) — (beef) 
— (bdef) - (сй) — (bf) - (of) — (df) — (ef) 


This is also the expression for АР, since all the + signs contain both 
the letters a and f, while the — signs contain only one of these letters. 
The same check may be applied to BCDEF and BCDE. 

In seeking a better design for the 28 factorial in а quarter-replicate, 
we would like the defining contrasts, and all aliases of main effects and 
of two-factor interactions, to be interactions of high order that can be 
assumed negligible. Trial and error shows that this cannot be achieved. 
The best procedure is to select as defining contrasts 2 four-factor inter- 
actions like ABCE and ABDF that have two letters in common. Their 
generalized interaction, the third defining contrast, is another four-factor 
interaction CDEF. 

With this design it is clear that the alias of any main effect is an inter- 
action among at least three factors. Some of the aliases of two-factor 
interactions, however, are themselves two-factor interactions. If we 
proceed systematically to write down the aliases of each of the 15 two- 
factor interactions, we find that when we have done this for 7 of the two- 
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factor interactions, the remaining 8 have already turned up as aliases. 
The 7 sets are shown in table 6A.4. 


TABLE 6A.4 ALIASES OF TWO-FACTOR INTERACTIONS IN A 2° FACTORIAL 
(М replicate) 


Defining contrasts: ABCE, ABDI, CDEF 


Two-factor 

interaction Aliases 
AB СЕ, DF, ABCDEF 
AC BE, BCDF, ADEF 
AD BF, BDCE, ACEF 
AE BC, ВРЕР, ACDF 
AF BD, BCEF, ACDE 
cD ЕР, ABDE, АВСР 
OF DE, АВЕР, ABCD 


To summarize, this arrangement furnishes valid estimates of the main 
effects, provided that three-factor interactions can be ignored. Two- 
factor interactions are badly mixed up. 

There are several situations in which this design сап be used. Tf all 
two-factor interactions are thought to be relatively small, we estimate 
only the main effects, the remaining comparisons providing an estimate 
of error. The analysis of variance is as follows: 


d.f. 

Main effects 6 
Error (from interactions) 9 
Total 15 


The 9 d.f. for error are composed of the 7 alias sets given above that 


involve two-factor interactions, plus 2 alias sets that contain only three- 
factor interactions, namely: 


ACF = BCD = BEF = ADE. and ACD = AEF = BCF = BDE 
Alternatively, 
provided that th 
interact with any of the other factors or with each other. In the 7 


estimates of AB, AC, AD, BC, BD, and CD, respeetively. The 7th set, 
which contains CF and DE, contributes to the estimate of error. This 
appears to be as far as we can go in disentangling the two-factor inter- 
actions. If this analysis is adopted, the d.f. subdivide as follows: 
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d.f. 
Main effects 6 
Two-factorinteractions of А, B, CL 6 
Error (from remaining interactions) 3 


Total 15 


The d.f. for error are scanty. 

The plan for this quarter-replicate can be written down by using 
ABCE to divide the 64 combinations into halves, and then using ABDF 
to divide the chosen half into quarters. If the selected quarter is 
+ABCE, --ABDF, the following 16 treatment combinations are tested. 


abcdef аба) cdef abce acd ade bed bde 
acf aef. bef bef ab ce df (1) 


This account of fractional factorial designs is intended only as an 
introduction sufficient to enable the plans at the end of this chapter to 
be used with understanding. Further discussions are given in references 
(6A.1), (6A.4), and (6A.5). 


6A.15 Example of the Statistical Analysis. The experiment is a half- 
replicate of a 2° factorial. Factors A, В, С, D, and Е are different 
constituents that go into the commercial manufacture of icing for cakes, 
and factor Ё is a variable that the housewife may introduce in using the 
icing. The defining contrast is ABCDEF, and the half-replicate used is 
the same as that in table 0A.3. The variable to be analyzed is the 
texture of the icing. Table 6A.5 shows the texture readings for the 32 
treatment combinations. The analysis proceeds as in table 6A.5. 

1. Decide which factorial effects are to be isolated and tested, and 
which are to constitute the estimate of error. (If some of the factors 
to be isolated have aliases of the same order, e.g., if CD = EF, it is nec- 
essary to decide also which alias is to be regarded as producing any 
effect that is found.) In the example, all 6 d.f. for main effects and all 
15 d.f. for two-factor interactions have higher-order interactions as 
aliases, and all will be estimated. The remaining 10 d.f. supply the 
error mean square. 

2. There are several ways of computing the sum of squares for main 
effects and two-factor interactions. One is to construct the 15 two-way 
tables for every pair of variables. Each table gives the sum of squares 
for one interaction and two main effects: these sums of squares are caleu- 
lated by the usual rules of the analysis of variance. Another method is 
to write down the combination of signs for each of the 21 effects that are 
to be estimated, as was done in table 6.4, p. 190, for a 2% factorial. 
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6 
TABLE 6A.5 STATISTICAL ANALYSIS OF }4-REPLICATE OF A 2° FACTORIAL 


Treat- 


ment (5) 

combi- Effect 4 А 

nation Texture (1) (2) (3) (4) total Identification 

(1) 233 450 1034 2200 4634 8985 G 

a(f) 217 584 1166 2434 4351 151 А 

қ) 267 483 118484 206 -18 317 В 

ab 317 083 1250 1935 349  —49 АВ 

ef 250 517 983 34 44 649 С 

ас 233 667 1433 —232 —117 Дд АС 

be 333 60 967 16 134 -Я BC 

abe(f) 350 600 968 33 —183 151. ABC (Error) 

d(f) 250 59 34 334 198 -247 р 

ad 207 433 0 100 451 51 Ар 

bd 400 733 -П6 —150 =34 1—51 BD 

abd(f) 267 70 116 33 49 51 АВР (Error) 

cd 400 467 17 10  —134 -515 CD 

acd(f) 250 500 = 34 51 719 ACD (Error) 

bed(f) 283 484 183 -150 302 —383 ВСР (Error) 

abed 87 44 200 —33 —151 -617 EF = ABCD 
8985 9136 9404 10136 8544 

ef) 200  —16 134 132 24 —283 Е 

ае 350 50 200 66 —481 547 AE 

be 288 -17 150 450 -26 —551 BE 

abe(f) 150 17 =50 1 817 —317 ABE (Error) 

ce 400 17—17. —34 -24 253 СЕ 

ace(f) 333 —133 -33 0 183 83 АСЕ (Error) 

bee(f) 87 —150 33 —18 -66 185 ВСЕ (Error) 

abce 383 34 0 67 117 -—453 РЕ = ABCE 

de 207 150 66 66 -66 -75 DE 

ade(f) 200 —133 34 —200 -449 583 ADE (Error) 

bde(f) 150 | —67 —150 84 34 417 BDE (Error) 

abde 350 66 184 —33 85 188 ОР = ABDE 

cde(f) 107 —67 —283 —32 -2966  —383 СРЕ (Error) 

acde 367 20 133 894 “117 51 BF = ACDE 

bede 27 250 207 416 366 149 АЁ = BCDE 


abcde(f) 217 —50  —300 -567 —983 —1349 Г = ABCDE 


Odds 4417 4484 4336 5864 5072 
Evens 4568 4702 5068 4272 3472 


8985 9136 9404 10136 8544 694 


A third method, to be illustrated here, is an adaptation of Yates’ 
automatic method (section 5.24a) of finding all factorial effect totals in 
a 2" factorial. This is the quickest method for the larger fractional 
factorial experiments, The adaptation proceeds аз follows. 
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Since the experiment has 32 treatment combinations, it contains a 
complete replication of the first 5 factors if F is ignored. Arrange these 
5 factors in the systematic order required for the application of Yates’ 
method, as shown in table 6A.5. In this table the treatment symbol f 
has been placed in ( ) as a reminder that f plays no part in the system- 
atic order. 

Next apply Yates’ method as if the experiment were a complete 
replicate of a 2° experiment. The factorial effect totals appear in col- 
umn (5). The Odds and Evens totals at the foot of each column and the 
totals at the foot of the first half of each column are check data as de- 
scribed in section 5.24a. 

If the factor F were not present, the figures in column (5) would 
represent the factorial effect totals for all main effects and interactions 
of the factors A, В, С, D, and Е. Now reintroduce the factor F. When 
this is done, each effect will have an alias involving Ё. Since the defin- 
ing contrast is ABCDEF, we have A = BCDEF; B = ACDEF; and so. 
on. It is necessary to identify the effects involving F that are to be 
estimated. These are the main effect, Ё, and the interactions AF,BF, 
СЕ, DF, and ЕР. The aliases of these effects are, respectively, 


F=ABCDE; АР =BCDE; ВР = ACDE; 
CF = ABDE; DF = ABCE; EF = ABCD 


Consequently, we identify‘the interaction ABCDE as F, the interaction 
BCDE as AF, and so on. ‘The identification assigned to each of the 32 
factorial effect totals appears on the.right of column (5). 

From the 10 factorial effects that furnish the estimate of error, the 
error mean square per unit can be computed directly as follows, without 
constructing an analysis of variance table. 

2 Зари 2 
Error m.s. = аур! о" ДЕР) = 3089 
(32) (10) 
The divisor is the product of the number of units in the experiment (32) 
and the number of error d.f. (10). The standard error of a single unit 
is 55.6, and the standard error of a factorial effect total is 4/32(55.6) = 
309.6. For 10 d.f., the 5% and 1% t-values are 2.228 and 3.169, re- 
spectively. Hence the two numbers 


(2.228)(309.6) = 690; (3.169)(309.6) = 981 
give the values required for statistical significance at the 5% and 1% 
levels, respectively. The main effect of F is significant at the 1% level, 


as appears obvious from inspection of the original data. Ot the remain- 
ing 20 comparisons that are tested, only the DE interaction reaches the 
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5% level. This result may well be due to chance fluctuations, since one 
apparently significant result out of 20 would be expected. 

For further study of the results, tables of the mean yields for certain 
groups of treatment combinations, e.g., for the lower and upper levels 
of Г, will be wanted. These means сап be computed directly from the 
original yields, or by expressing them as linear functions of the factorial 
effect totals. For example, the four mean yields in the DE two-way table 
are obtained as in table 6A.6. 


TABLE 6А.6 COMPUTATION OF MEAN YIELDS IN THE DE TWO-WAY TABLE FROM 
FACTORIAL EFFECT TOTALS 


Factorial effect total 
Mean for G D E DE Mean 
Combination | 8985 —247 | —283  -715|Divisor| yield 


(2 ЗЕ = = + 32 275.0 
(d) + + = - 82 304.2 
(е) + = + - 82 302.0 
(de) 8 Р + + 32 241.9 


The divisor (32) is the total number of units in the experiment. The 
reader may verify that these results agree with those found by adding > 
the appropriate groups of original yields and dividing by 8. 

Yates’ method is applicable to all fractional faetorials of the 2" series, 
including quarter- and eighth-replieate designs. The general approach 
is to select a sub-set of the factors for which the experiment is a complete 
replicate. With a quarter-replicate, two factors will be ignored tempo- 
rarily: with an eighth-replicate, three factors. The factorial effect totals 
for the sub-set are computed by the Yates’ method, Then the other 
factors are introduced by writing down the alias sets and making the 
i ені identifications in the list of factorial effects found for the 
sub-set. 


64.16 The Reduction of Block Size by Confounding. The designs 
discussed thus far assume that, the experiment is to be completely ran- 
domized. Since arrangements in 16 or 32 units are rather imprecise in 
many types of experimentation, designs in blocks of smaller sizes may 
be wanted. These can be obtained by confounding certain factorial 
effects with blocks. The principles will be illustrated by the 2° factorial 
in 32 units (75 replicate), with ABCDEF as defining contrast. As we 
have seen (section 64,14), all other factorial effects group themselves 
into pairs of aliases: A = BCDEF, AB = CDEF, ABC = DEF, and 
so on. 
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If blocks of 16 units are wanted, any factorial effect other than 
ABCDEF may be used to divide the 32 units into two blocks of 16 units. 
All information on this factorial effect and on its alias will be lost, since 
both are confounded with blocks. The problem is therefore to find а 
pair of aliases both of which can be sacrificed. In the present instance, 
the best choice is clearly a pair of three-factor interactions like ABC 
and DEF. As with fully replicated designs, this confounding does not 
disturb any of the other factorial effects. 

The plan (table 6A.7) is constructed by first taking all treatment 
combinations having an even number of the letters a, b, с, d, e, f to form 
the half-replicate. This set of 32 treatments is divided into two blocks 
such that block 1 contains only even numbers of the letters a, b, or c. 
(It also contains only even numbers of the letters d, e, and f, since ABC 
— DEF.) The partition of the d.f. is shown at the side of the plan. 


TABLE 6A.7 А 14 REPLICATE OF A 26 FACTORIAL IN BLOCKS OF 16 UNITS 
Defining contrast: ABCDEF Confounded with blocks: ABC = DEF 
Block 1 Block 2 


(1) ad 

de ae 

df af 

ef bd 

ab be d.f. 
ac bf Blocks 1 
be cd Main effects 6 
abde ce Two-factor interactions 15 
abdf of Error (from three-factor 

abef adef interactions) 9 
acde bdef - 
асау cdef Total 31 
асе} аха 

bede abce 

bedf abef 

beef abcdef 


For a design in blocks of 8 units, another alias pair must also be con- 
founded with blocks. Suppose that ABD and its alias CEF are selected. 
It will be recalled (section 6.13) that the generalized interaction of this 
alias pair with ABC and DEF is also confounded. This interaction is 
the alias pair CD and ABEF. It is unfortunate that a two-factor inter- 
action is confounded, but further trial shows that this is unavoidable in 
blocks of 8 units. To construct the plan (table 6A.8), divide each block 
of table 6A.7 into halves that contain respectively an odd and even 
number of the letters с and 4. 
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TABLE 6A.8 А HALF-REPLICATE OF A 2° FACTORIAL IN BLOCKS OF 8 UNITS 
Confounded with blocks: ABC, DEF, ABD, CEF, CD, ABEF 


Block 

1 2 3 4 
(1) ае ае ай 
ef df af bd 
ab ac be ce 
abef bc bf of 
acde abde ed abce 
аса] abdf abed abef 
bede acef cdef adef 
bedf beef abedef bdef 


In practice, the letters should be allocated to the factors so that CD 
is the least important two-factor interaction. In the analysis of vari- 
ance, the sums of squares for blocks, main effects, and the unconfounded 
two-factor interactions are all computed in the usual way. If all avail- 
able two-factor interactions are to be estimated, the d.f. subdivide as 
follows: 


d.f. 

Blocks (includes CD) 
Main effects 6 
Two-factor interactions (except CD) 14 
Error 8 
Total 31 


With blocks of 4 units there are 7 d.f. for blocks, so that 7 alias pairs 
must be thrown away. The best solution is to choose АЁ to form 
blocks of 4 units from the blocks of 8 units. It will be found that one 
additional two-factor interaction, BF, is also lost (this being the inter- 
action of АЕ with ABEF). This arrangement (shown in plan 6A.6) 
gives estimates of all main effects and 12 of the 15 two-factor interac- 
tions, but sacrifices CD, АЕ, and BF. 

For blocks of 2 units, investigation shows that all 15 two-factor inter- 
actions must be confounded with blocks if main effects are to be kept 
clear. 

To summarize, the reduction of block size by confounding has the 
same general consequences in fractional factorials as in ordinary fac- 
torials. It becomes impossible to estimate the factorial effects (and their 
aliases) that are confounded with blocks. The rest of the factorial 
effects are undisturbed and are estimated in the usual way (except those 
that serve as defining contrasts). 
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6A.2 The Use of Fractional Factorial Designs in Practice 


64.21 Designs in Which All Main Effects and Two-factor Interactions 
Can Be Estimated. These designs are useful in the following circum- 
stances: 

1. The factors are such that all two-factor interactions must be 
examined. 

2. Experimental errors are substantial, requiring a relatively large 
amount of internal replication to provide satisfactory estimates of the 
main effects and two-factor interactions. 

3. A complete replication is considered too large or is not feasible. 

These conditions sometimes apply when numerous factors are being 
investigated in agricultural field experiments. Most of the uses of frac- 
tional factorials in this area have involved designs that give all, or nearly 
all, the two-factor interactions. 

Table 6A.9 is an index to the plans of this type for experiments with 
up to 8 factors at 2 levels each. In these plans, no main effect or two- 
factor interaction is confounded with blocks, and all aliases of any main 
effect or any two-factor interaction are interactions among at least 3 
factors. 


TABLE 6A.9 FRACTIONAL FACTORIAL DESIGNS IN WHICH ALL MAIN EFFECTS AND 
TWO-FACTOR INTERACTIONS CAN BE ESTIMATED 


(All factors at 2 levels) 
Size of 
No. of experiment Fraction Plan 
factors (units) of a rep. Block sizes no. 
5 16 1 16* 64.3 
6 32 м 32, 16 6A.6 
% 64 M 64, 32, 16, 8 6A.13 
8 64 и 64, 32, 16 64.16 
8 128 и 128, 64, 32, 16 6А17 


* Provides no error d.f. if two-factor interactions are estimated. 


All the experiments are half-replicates except one. The allowable 
block sizes are on the large side. If the size of the experiment is satis- 
factory but the block size is too large, the reader should consult the full 
index (table 6A.10) to see which two-factor interactions must be sacri- 
ficed to permit a further reduction in block size. Sometimes only a few 
are lost in this way. 

In a number of these plans, a large majority of the three-factor inter- 
actions are also estimable. Where this is so, a note about the status of 
the three-factor interactions is given with the plans. If the investigator 
anticipates that he may wish to examine some of the three-factor inter- 
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actions, the plan should be checked in advance to see whether this can 
be done. 

With 9, 10, or 11 factors, 128 units are necessary in order to keep all 
main effects and two-factor interactions estimable. Plans for various 
block sizes are given by Connor (6A.6). 


6A.22 Designs in Which Some Two-factor Interactions Are Sacrificed. 
If a more drastic reduction is made either in the size of the experiment 
or in the size of block, at least some of the two-factor interactions are 
lost, because they are either aliases of other two-factor interactions or 
are confounded with blocks. The number of d.f. available for an esti- 
mate of error also declines, particularly with a reduction in the size of 
the experiment. The situation with regard to two-factor interactions 
and error d.f. varies from plan to plan, and is best appreciated by ex- 
amining some of the plans. 

The designs discussed in this section are especially appropriate when 
the following conditions apply: 

1. Most of the two-factor interactions are likely to be negligible. 

2. Experimental errors are small, making a large experiment unneces- 
sary from the viewpoint of precision. 

3. A good estimate of the standard deviation of the experimental 
errors is available from previous experiments. This condition is not 
essential in all the plans, since some of them furnish enough d.f. for an 
internal estimate of error. 

The attractiveness of a severe reduction in the size of the experiment 
is increased if experiments can be done quickly. In this situation the 
investigator often prefers to keep his experiments small, revising his 
strategy at the end of each experiment in order to avoid doing a lot of 
work on treatments that turn out to have only minor effects. 

The plans cover from 4 to 8 factors, in experiments of sizes from 8 to 
64 units (1.е., treatment combinations to be tested). Block sizes range 
from 4 to 64 units. Table 6A.10 is an index to the plans (including those 
discussed in the previous section). 

Before the table is entered to see if a suitable plan exists, the user 
must have decided upon: 

(i) the number of factors, 
(ii) the size of the experiment, 

(iii) the size of block, 

(iv) the two-factor interactions that can be assumed negligible and 

the two-factor interactions that it is desired to estimate, 

(v) whether an estimate of error must be made from the results of 

the experiment, or whether the error variance is well enough 
known from previous work. 
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TABLE 6A.10 INDEX TO PLANS FOR 2" FACTORIALS IN FRACTIONAL REPLICATION 


Frac- 
No. of tion of 
factors а rep. 


4 
5 


м 


xx 


x 


ЖЖ 


Size 


of 


expt. 


8 


8 
16 


32 


64 


16 


32 


61 


128 


‘Two-factor 
interactions Error d.f. 
— — ` 
Max. 2-factors 2-factors 
no. esti- used as esti- 
Total  mable error mated 
6 3* 3 0 
10 2% 2 0 
10 10 10 0 
10 9 9 0 
10 7 7 0 
15 (MJ 1 0 
15 ue 9 2 
15 де, 8 1 
15 6* 6 0 
15 15 25 10 
15 15 24 9 
15 14 22 8 
15 12 18 6 
21 0 0 0 
21 T* 8 1 
21 Tit 7 0 
21 4* 5 1 
21 181 24 6 
21 18 | 23 5 
21 17 21 4 
21 141 17 3 
21 21 56 35 
21 21 55 34 
21 21 53 32 
21 21 49 28 
21 15 41 26 
28 qus. 7 0 
28 6 * 6 0 
28 4* 4 9 
28 20 ł 23 3 
28 191 22 3 
28 171 20 3 
28 13 16 3 
28 28 55 27 
28 28 51 26 
28 28 52 24 
28 26 18 22 
28 21 10 19 
28 28 119 9t 
28 28 118 90 
28 28 116 88 
28 28 112 84 
28 26 104 78 


Plan 
no. 


6A.11 


оооо oOococoocoooo 
SESE РЕЕРЕРРЕ 
moon © С› < © бл л ль 


* All these interactions have other 2-factor interactions as aliases, and are esti- 
mable only if the aliases can be considered negligible. See plans for details. 
1 Some of the interactions have other 2-factor interactions as aliases. See plans 


for details. 


i Except in unusual circumstances, only main effects can be estimated with this 


design, 


262 FACTORIAL EXPERIMENTS IN FRACTIONAL REPLICATION 6A.2 


As regards (iv), informed guesses about the importance of the two- 
factor interactions can frequently be made from previous experimenta- 
tion or from knowledge of the way in which the factors produce their 
effects. ‘The greater the number of two-factor interactions that are un- 
important, the better the chance that a suitable plan exists. It is worth 
noting that interactions can sometimes be reduced іп magnitude by a 
skillful choice of the factors or of the scale in which results are analyzed 
and interpreted. The sizes of the differences between the two levels of 
the factors are also relevant. With small changes in levels that produce 
moderate or small main effects, the assumption that the effects of the 
factors are additive is less likely to be seriously wrong than when the 
experiment explores а, wide range of variations in levels. 

Ав regards the error variance, the provision of a self-contained esti- 
mate of error and of valid tests of significance is one of the principal 
strengths of modern experimental design. If the use of an estimate of 
error from previous experiments is contemplated, some preliminary 
questions should be answered. Were the previous experiments con- 
ducted on similar experimental material, under similar conditions of 
experimentation and with treatment effects of about the same order of 
magnitude? Were the previous error variances homogeneous from one 
experiment to another, as indicated by a test of homogeneity of vari- 
ances? If these questions are answered in the affirmative, the risk in 
taking an external estimate of error is decreased. 

The use of table 6A.10 will be illustrated by a few examples. To start 
with an experiment that has already been briefly discussed, suppose that 
there are 6 factors, to be tested in 32 units (14 replicate) and that a 
block size of 4 units is advisable for precise comparisons. The experi- 
ment will then consist of 8 blocks of 4 units each. Preferably, all the 
two-factor interactions are to be estimable, although a few of them may 
be non-existent. The experiment is to furnish its own estimate of error. 

Entering table 6A.10 for 6 factors, size of experiment = 32, and block 
size = 4, we see that 12 of the 15 two-factor interactions can be esti- 
mated. To discover which interactions must be sacrificed we refer to 
the plan (plan 6A.6). The 3 two-factor interactions that are lost are 
AH, BF, and CD, which are confounded with blocks. This plan is 
practicable with respect to two-factor interactions if the letters can be 
assigned to factors so that these three interactions are unlikely to be 
appreciable. It is unfortunate that the three interactions involve all 
6 factors. 

The situation with regard to error d.f. is shown in the two columns of 
table 6A.10 headed “Error 44.” The right hand column of this pair 
gives the number of error d.f. that the experiment provides when all 
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available two-factor interactions are estimated. In this case there are 
6 d.f. The left hand column shows the number of error d.f. when all 
two-factor interactions are assumed small and are pooled with the error. 
For this plan we would have 18, 1.е., (6 + 12) error d.f. in that event. 
Ап intermediate course may be adopted, some of the 12 d.f. for two- 
factor interactions being estimated and the rest pooled with the error. 

To summarize, this plan gives independent estimates of all main 
effects and nearly all two-factor interactions, plus a small but tolerable 
number of d.f. for the estimate of error. 

To take an example with a larger reduction in the size of the experi- 
ment, suppose that 7 factors are to be investigated in 16 tests (1g repli- 
cate), with block size 8. Table 6A.10 shows that at most 7 of the 21 
two-factor interactions are estimable. Moreover, the asterisk attached 
to the number 7 is a warning that these 7 are all aliases of other two- 
factor interactions, and are estimable only if their two-factor aliases 
can be assumed negligible. Detailed scrutiny of the aliases (given with 
plan 64.9) is needed to reach a verdict оп this point. 

The “Error d.f.” columns of table 6A.10 reveal that 7 error d.f. are 
obtainable if all two-factor interactions are included with the error, but 
none if two-factor interactions are isolated. 

We conclude that if the experiment is to provide its own estimate of 
error, this plan is good only for estimating main effects under conditions 
in which all interactions are negligible. 

For 5, 6, and 7 factors the plans with the greatest reductions in the size 
of the experiment are those in experiments that include only 8 treat- 
ment combinations. In these plans all two-factor interactions must be 
negligible (since two-factor interactions are aliases of main effects) and 
an external estimate of error must be available. If all two-factor inter- 
actions are unimportant but an internal estimate of error is a necessity, 
the plans with size of experiment equal to 16 units may be suitable. 
Additional plans of this type in 12, 20, and 24 units in the experiment 
can be obtained in (6A.2). 


6A.23 The Hazards of Fractional Replication. As we have seen, the 
results of an experiment with fractional replication may be misinter- 
preted if some of the interactions that have been assumed negligible 
are not so. Since the extent of this danger will vary from case to case, 
it is hard to present an overall picture of it. The following example, 
although it over-simplifies the issue, exhibits the nature of the problem. 

There are 3 factors. A has no effect, while B and C have positive 
effects and a positive interaction. We shall suppose that B gives an 
increase of 40 when applied at the lower level of C and of 60 at the upper 
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level of С, while С gives increases of 20 and 40 at the lower and upper 
levels of B, respectively. With a basal level of 100 for the (1) combina- 
tion and no experimental error, the results for the 8 treatment combina- 
tions are found to be as follows: 


TABLE 6A.11 HYPOTHETICAL RESULTS IN A 2° EXPERIMENT 


(a) 100 (1) 100 
(5) 140 (а) 140 
(с) 120 (ас) 120 
(abe) 180 (bc) 180 


The experimenter, of course, does not know these results. He runs a 
half-replicate, with ABC as defining contrast. The results that he ob- 
tains consist of those in the left column of table 64.11 if 4- АВС is chosen, 
and those in the right column if —АВС is chosen. 

The two sets of main effects are estimated as follows: 


From +ABC; 
_ (во) + (а) — 0 =O) _ 


А 1 +10; В = +50; С = +30 
From —АВС: 

(ab) + (ac) — (be) — (1 
44 E9109 о, B = +50; С = +30 


The two half-replicates give the same results for В and for С: these 
are in fact the correct responses to B and C over all 8 treatment com- 
binations. The first half-replicate shows an erroneous increase of 10 
to A, while the second half shows an equal decrease to А. Neither set 
of results gives a warning about the presence of a BC interaction (unless 
the experimenter speculates that this might be one interpretation of the 
results). 

How much harm is done depends on the decisions made from these 
results, Three situations will be considered briefly. 

(i) If the experiment is primarily of the “screening” type, designed 
to pick out the most important factors, the conclusion may be drawn 
that B and C are important but A is relatively unimportant. Further 
experimentation is confined to В and С and presumably reveals the BC 
interaction. Little or по harm Ваз been done. 

Gi) If the experiment is part of a program of fundamental research, 
the investigator wrongly concludes either that A has a beneficial effect 
or that it has a deleterious effect. In the absence of further experiments, 
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energy may be dissipated in creating incorrect theories to explain the 
“effect” of A or in disputes with colleagues. 

(iii) In a program of applied research, the object may be to select the 
best combination of levels of the factors. In the first half-replicate, 
(abe) gives the highest observed yield. If the results of this half-repli- 
cate are used to speculate about the response from the combinations 
(ab), (ac), and (bc) that were not tested, all these treatment combina- 
tions will be predicted to be inferior to (abc). For instance, (bc) is pre- 
dicted as 10 units poorer than (abc), since A gives an increase of 10 in 
this half-replicate. Hence, (abc) is the recommended optimum. 

If the second half-replicate is chosen, (bc) gives the best observed 
yield. It will be predicted to be superior to a combination like (abc) 
that was not tested, since A produces a decrease of 10 in the second half- 
replicate. 

Actually, both choices (abc) and (bc) give the correct highest yield, 
180. The confusion is about the level at which A must be set. Ifthe 
level at which А is put involves a substantial difference in cost, and if 
the investigator argues as above without further experimentation, he 
stands a chance of recommending an unnecessarily expensive “optimum.” 

From the foregoing it is clear that one cannot make flat statements 
about the danger of misinterpreting the results of a fractional factorial. 
The penalties may be trivial, or they may be more serious. The example 
discussed above is not necessarily typical: examples in which more dif- 
ficulty is created by the use of fractional replication can easily be con- 
structed. In general, it seems wise not to lean heavily on fractional 
replication as a weapon for investigation unless the risk of confusion 
from the presence of factor interactions is judged to be small. 


6A.24 Fractional Replication in Time. Davies and Hay (6A.7) have 
pointed out that fractional factorials may save effort in lines of work 
where an experiment can be conducted one part at a time. Suppose 
that a complete replication of a 2* experiment is planned, and that 8 
treatment combinations can be processed at one time, say in a week. 
The obvious procedure is to divide the replicate into two blocks of 8, 
confounding ABCD between blocks. In the fractional factorial ap- 
proach, we stop when the first block (half-replicate) has been completed, 
and decide from the results whether to continue with the second block 
or to change the direction of experimentation. If A alone shows a defi- 
nite response, we may decide that further experimentation with these 4 
factors is unjustified. On the other hand, if A, B, and the alias pair 
AD = BC all give marked effects, we complete the replicate in order to 
obtain independent estimates of AD and BC so as to elucidate the na- 
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ture of these interactions. Similarly, if a half-replicate of a large fac- 
torial experiment is envisioned, it may be time-saving to start with a 
quarter-replicate and decide on the next step when the results of this 
part are known. 

To save space, the plans in this book present only one specific fraction 
of the replicate. If the experimenter decides to follow one quarter-repli- 
cate with another, a rule is needed for writing down the plan for the 
second quarter-replicate. The rule is simple. To obtain another frac- 
tional replicate, multiply all the treatment combinations in the plan by 
any combination of letters that does not appear in the plan, with the 
usual convention that the square of any letter is replaced by 1. For 
example, the following are the 8 treatment combinations in plan 6A.2 
for a 2° factorial in an experiment of size 8: 


(1) ab cd ace bee ade bde abcd 
Multiplication by a produces a second quarter-replicate 
а b acd ce abce de abde bed 


To generate a third quarter, multiply the first set by any combination of 
letters that does not appear in either of the first two sets. Thus e will 
do, but not b or acd. 

If the second quarter-replicate is being run in order to disentangle 
some aliases, it must be chosen with care. The following example illus- 
trates the general procedure. In the first quarter-replicate above, the 
defining contrasts are ABE, CDE, and ABCD. When we create another 
quarter-replicate, the half-replicate so formed will have as its defining 
contrast one of the three effects ABE, CDE, or ABCD. By an appro- 
priate choice of the second quarter, we can make any specified one of 
these three effects the defining contrast. 

In the first quartér-replicate, main effects are entangled with two- 
factor interactions (e.g., A = BE, C = DE). If the second quarter is 
undertaken in order to free the main effects from this entanglement, we 
must have ABCD as the defining contrast for the half-replicate, because 
the choice of ABE or CDE still leaves some main effects with two-factor 
interactions as aliases. To form the correct second experiment, multiply 
the treatment combinations in the first by any combination of letters 
that 


(i) does not appear in the first fraction and 


Gi) has an even number of the letters in the defining contrast that we 
want to have. 
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For instance, е is satisfactory as a multiplier, having no letters in com- 
mon with a, b, c, d, but a is not. 

The reasoning behind this rule is as follows. In the plans we have 
always shown the fractional replicate in which all treatment combina- 
tions have an even number of letters in common with each of the defining 
contrasts. (This fraction is called the principal block.) Consequently, 
if ABCD is to remain a defining contrast, all treatment combinations 
іп the half-replicate must have an even number of the letters а, b, c, 
and d. Multiplying the principal block by an even number of these 
letters, as in condition (ii) above, achieves this end. 


6A.25 Randomization. When the plan has been selected, the steps are: 

1. Randomize the blocks. 

2. Randomize the treatment combinations within each block. 

It is also preferable to select the fractional replication at random 
from those that can be generated by the set of defining contrasts that 
are being used. With a half-replicate, for instance, we decide at random 
which of the two halves is used. This step increases the validity of tests 
of significance made on the results, and if similar experiments are being 
conducted in different places it provides a wider sampling of the possible 
treatment combinations. 

As we have mentioned, the plans present only one specific fraction 
of the replicate. Although the other fractions can be written down by 
the rules given in the preceding section, the following alternative method 
avoids this step, using the plan as given. 

With a half-replicate, decide at random for one of the factors whether 
the presence of the letter is to represent the first or the second level. 
With factors that are quantitative, it is natural for the experimenter to 
let the letter, say a, denote the higher level. Consequently, this random 
choice should be made for a qualitative factor, if there is one in the ex- 
periment, or more generally for the factor that will cause the least con- 
fusion to the experimenter. The reader may verify, for any of the half- 
replicate plans, that this choice represents a random selection of one of 
the two possible halves. 

For a quarter-replicate, assign the letter to the levels at random for 
any two of the factors, provided that this pair of letters does not occur 
as a treatment combination in the plan. To illustrate, in plan 6A.5 for 
6 factors in a 14 replicate, blocks of 4 units, the only pairs of letters that 
occur are ab, ce, and df. Any pair other than this trio (e.g., ae, bc) 
will do. For an eighth-replicate, any 3 of the factors whose letter com- 
bination does not appear in the plan may have their levels assigned at 
random to letters. 
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6A.26 Statistical Analysis. If there is no confounding with blocks, 
the procedure in section 6A.15 should be followed. When the experi- 
ment is arranged in blocks, the factorial effects divide themselves into 
three groups which must be clearly distinguished. 

(i) The effects that are to be estimated. Each plan shows which two- 
factor interactions are estimable: this information is also given for 
three-factor interactions in the larger designs. 

(ii) The effects that are confounded with blocks. These effects are 
also given, either under the plan or above the analysis of variance in 
cases where a plan is constructed by combining the blocks from a pre- 
vious plan. 

(iii) The effects that are to be used for an estimate of experimental 
error. 

The example in section 6A.15 on the texture of cake icing will be used 
to provide an illustration for a half-replicate of a 2° factorial, confounded 
in four blocks of 8 units. In table 6A.12 the data for this experiment 
are arranged in the blocks given by plan 6A.6. Artificial block effects 
of —40, +10, 0, and +30 per observation have been superimposed on 
the data. 


TABLE 6A.12 А 2° FACTORIAL (14 REPLICATE) IN BLOCKS OF 8 UNITS 


Texture of icing after 72 hours 


() (2) (3) (4) 
(1) 193 ас 243 ае 350 аа 297 
ab 277 be 343 af 217 bd 430 
ef 160 de 277 be 283 ce 430 
abef 110 df 260 bf 267 cf 280 
acde 327 abde 360 са 400 abce 413 
аса/ 210 аһа 277 abed 317 abcf 380 
bede 227 acf 343 cdef 117 айе. 230 
bedf 243 beef 327 abcdef 217 bdef 180 


CD, ABC, ABD confounded. 


The analysis proceeds as in section 6А.15. The factor F is temporarily 
suppressed. The remaining factors, which form a complete replicate, 
are arranged in systematic order for the application of Yates’ method 
(table 64.13). When the factorial effect, totals have been obtained by 
this method (column 5), it remains to reintroduce F and identify each 
one appropriately. 
| We shall suppose that all main effects and the 14 available 2-factor 
interactions are to be estimated (CD is lost by confounding). Тһе identi- 
fications are given on the right of table 6A.13. Be sure that the blocks 
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TABLE 6А.13 COMPUTATION OF FACTORIAL EFFECT TOTALS 
Treat- 
ment 
combi- 
nation Texture (1) (2) (3) (4) (5) Identification 
(1) 193 410 954 2200 4634 8985 G 
a(f) 217 544 1246 2434 4351 л Л 
b(f) 207 523 1264 2416 — —198 317 В 
ab 277 723 1170 1935 349 -4 АВ 
e(f) 280 557 903 34 434 649 С 
ac 243 707 158 -92  -17 15 AC 
bc 343 610 1047 16 1344  -83 BC 
аў) 380 560 888 333 —183 631 АВС (Blocks) 
d(f) 260 510 34 334 198  —247 р 
ad 297 393 0 100 451 51 AD 
bd 430 773 -П6 -150 -34 -51 BD 
abd(f) 277 740 —116 33 49 21 ABD (Blocks) 
cd 400 507 17 60 —134 -1155 CD (Blocks) 
acd(f) 210 540 -1 74 51 119 АСР (Error) 
Ьсі) 243 444 133 —190 542  —383 ВСР (Error) 
abed 317 444 200 7 89 —617 EF = ABCD 

8985 9136 9404 10616 8544 
e(f) 160 24 134 292 284 —283 Е 
ае 350 10 200 —94 —481 541 АЕ 
be 283 | —37 150 610 -26 -51 BE 
abe(f) 110 37 -50 -159 817  —317 АВЕ (Error) 
се 430 37 -1n7 —34  -234 253 СЕ 
ace(f) 343  —153 -33 0 183 83 ACE (Error) 
bee(f) 827 -190 83 -18 14 185 ВСЕ (Error) 
abce 413 74 0 67 197 -—453 DF = ABCE 
de 277 100 —14 66 -3%8 -Т5 DE 
ade(f) 230  —173 74 —200 -769 583 ADE (Error) 
bde(f) 180  —87  —190 84 34 417 ВРЕ (Error) 
abde 360 86 264 —33 85 188 CF = ABDE 
cde(f) 117  —47  —303 88 -266  —383 ОРЕ (Error) 
acde 327 180 173 454 —117 51. BF = ACDE 
bede 227 210 227 536 366 149 AF = BCDE 
abcde(f) 217 -10 Á—220  —447 -93 -—1349 Р = ABCDE 
Odds 4417 4434 4096 6344 5072 
Evens 4568 4702 5308 4272 3472 
8985 9130 9404 10616 8544 6944 
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are correctly identified. It may be noted that the numerical values of 
all factorial effect totals are the same in table 6A.13 as in table 6A.5, 
except for the three effects CD, ABC, and ABD that are confounded 
with blocks. This is as it should be. 

In the analysis of variance (table 6A.14), the sums of squares for 
blocks, main effects, estimable two-factor interactions, and error are all 
obtained directly by adding the squares of the appropriate factorial 
effect totals, and dividing by the number of units in the experiment (32). 
Thus 


631)? + (211)? + (1155)? 
Blocks вв. = а = 55,522 


TABLE 64.14 CONDENSED ANALYSIS OF VARIANCE 


d.f. 8.8. m.s. 
Blocks 3 55,522 18,507 
Main effects 6 78,293 13,049 
2-factor interactions 14 57,405 4,100 
Error (from 3-factors) 8 30,091 3,761 
Total 31 221,311 


Alternatively, the analysis of variance can be dispensed with by 
finding the error mean square directly. From this, the standard error 
of a factorial effect total is computed, and tests of significance are per- 
formed directly on the effect totals in table 6A.13. 


6A.3 Designs with Factors at More Than Two Levels 


6A.31 Index to Plans. When some or all of the factors contain more 
than 2 levels, most of the fractional replication designs have two-factor 
interactions heavily mixed up with one another. We shall confine our- 
selves to a few designs in which all or nearly all of the two-factor inter- 
actions can be estimated. Table 6A.15 gives an index to the designs for 
which plans are presented. 


TABLE 6A.15 DESIGNS WITH FACTORS АТ MORE THAN 2 LEVELS 


Fraction Size of Block 
Type ofarep. experiment sizes Plan 
3i и 27 27,9 6A.18 
35 и 81 81, 27, 9 6А.19 
3x2 и 48 48, 24, 12 Section 64.35 
4x2 и 32 32, 16, 8 6А.20 
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A number of these arrangements have been used in agricultural re- 
search. Chinloy et al. (6A.8) describe the application of a 3° design, а 
14 replicate, in blocks of 9 plots, in an experiment on the manuring 
of sugar cane, the factors being nitrogen, phosphorus, potash, filter press 
mud, and bagasse (the crushed stalks minus the cane juice), each at 3 
levels. А 1$ replicate of a 37 factorial was used by Tischer and Kemp- 
thorne (6A.4) to study 7 factors that might affect the performance 
of the Adams consistometer, an instrument for measuring the con- 
sistency of canned food. This experiment required 37/9, or 243, differ- 
ent treatment combinations to be tested. Since 27 combinations could 
be handled in a day, the experiment was arranged in blocks of 27 units. 

The following sections give notes on the plans and mention some other 
possibilities. 


6А.32 The 3" Series. The problems of construction can be illustrated 
from а 14 replicate of a 3° factorial. It will be recalled (sections 6.15, 
6.16) that in a 3° factorial the 26 d.f. for the treatments sum of squares 
can be split into 13 pairs of d.f., as follows: 


Main effects: A, B, C. 
Two-factor interactions: AB(I), ABV), АС(7), ACV), ВС(1), BC(J). 
Three-factor interactions: ABC(W), ABC(X), ABC(Y), ABC(Z). 


To obtain a 14 replicate it is natural to choose a component of ABC 
as the defining contrast. This is easily done with the aid of plan 6.7 
(р. 238), in which four replicates of a 3° design are shown in blocks of 
9 units. In each replicate, a different component of ABC is confounded 
betiveen blocks. Replicate I confounds ABC(W). If any single block 
of this replicate is selected, we obtain a 15 replicate with ABC(W) ав 
the defining contrast. Table 6A.16 shows the plan given by the first 
block. 


TABLE 6A.16 A }4 REPLICATE OF A 3° DESIGN 
Defining contrast: ABC(W) 


000 
110 
220 
101 
211 
021 
202 
012 
122 
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For this plan, the alias system works out as shown below. 
А = BC(J) = АВС(2) 
В = AC(I) = ABC(Y) 
С = AB(I) = ABC(X) 
АВ() = АС) = BC) 


To verify the aliases of A, note that the main effect of A is computed 
by comparing the totals for the 3 levels of A. These totals are: 


Ао = 000 + 021 + 012 
A; = 101 + 110 + 122 
А» = 220 + 211 + 202 


Now consider the experiment with reference to В апа С, ignoring А. 
From section 6.15, where the J and J components of a two-factor inter- 
action are defined, it is seen that the three totals above represent re- 
spectively the Jı, J2, and J3 components of BC. Thus 


А = BC) 


The three-factor interaction ABC(Z) is shown in replication IV of 
plan 6.7, where it is confounded with blocks. If we compute ABC(Z) 
from the 9 treatment combinations that are available in the 14 repli- 
cate, it will again be found that this is the main effect of A. The other 
alias relationships can be checked in the same way. 

Owing to the confusion between main effects and two-factor interac- 
tions, this plan is not of practical use unless all two-factor interactions 
are negligible relative to main effects. 

For a 3* factorial, plan 6A.18 gives a 14 replicate in 27 observations, 
with a component of ABCD as defining contrast. Main effects are 
clear of two-factor interactions, but some confusion still exists among 
the latter, i.e., 


АВ() = ОРО); АС(1)- ВБИ); BCM) = ADD 


This plan may serve the purpose if one of the factors, say D, does 
not interact with any of the others. The 3 alias pairs above are then 
regarded as estimating AB(J), AC(I), and BC(I), respectively. Under 
this assumption, full information on the two-factor interactions of A, B, 
and C is obtained. The analysis of variance runs as follows: 
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"Effects d.f. 
Main 22 8 
АВ, АС, ВС 12 


Error (from AD, ВР, and Ср) 6 
Total 26 


If only the linear X linear components of the AB, AC, and BC interac- 
tions (section 5.26) appear likely to be appreciable, 9 additional d.f. 
can be included in the estimate of error. 

To obtain a design in blocks of 9 units, 2 d.f. representing two-factor 
interactions must be confounded. The best pair to sacrifice depends on 
the nature of the experiment. In plan 6A.18 we have confounded CD(1) 
with blocks, on the grounds that this design seems most useful when D 
does not interact with the other factors. The separation of d.f. is shown 
with the plan. 

With the 3° factorial (plan 6A.19), all main effects and two-factor 
interactions are estimable in a 15 replicate, with blocks of either 27 
or 81 units. If the block size is reduced to 9 units, 2 d.f. from AE are 
lost by confounding with blocks. 

In the analysis of variance for 3* and 3° designs, all main effects and 
unconfounded two-factor interactions are computed in the usual way by 
setting up the necessary two-way tables. 

In the 3" system, as in the 2" system, there are rules by which the 
aliases of any factorial effect can be discovered when the defining con- 
trasts have been selected, without having to construct the plan (6A.1, 
6A.4). 


6A.33 The 4" Series. Since experiments with all factors at 4 levels 
do not appear to be common, we have not presented plans for the 4" 
series. It is worth noting, however, that plans with some or all factors 
at 4 levels can be constructed quickly from the 2" plans. Let the 4 
levels of a factor A be denoted by ao, ал, a», and аз. As mentioned on 
р. 227, the three d.f. for the main effect of А can be partitioned into 
three orthogonal single d.f. 


А’ = (as) + (a2) — (а) — (в) 
А” = (аз) — (az) — (а) + (ао) 
Al” = (аз) — (az) + (а) — (ao) 


Suppose now that we start with two different factors С and D, each at 
two levels. Make the transformations 


сі = аз; с=0; (І)-а; d= a 
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Then the reader may verify that the main effect of C becomes identical 
with A’ as just given. Similarly, D = A”, and the interaction CD = 
А”. Thus С, D, and their interaction CD are transformed into the 3 
d.f. for the main effect of A. 

By this device, any pair of factors in a 2" plan can be replaced by a 
single factor at 4 levels. Take, for instance, the half-replicate plan for 
a 28 factorial, with ABCDEFGH as defining contrast, and replace the 
pair A, B by I; C, D by J; E, F by К; and G, Н by L, where J, J, K, 
L are at 4 levels each. The plan becomes a half-replicate of a 4* fac- 
torial, with ГГ" К "1 as defining contrast. The alias relationships 
in the 44 design are easily found from these in the 25 design. The equal- 
ity ABCD = EFGH, for example, becomes 1747” = КІ)”, Note 
that IJ" is a component of the two-factor interaction of J and J in 
the 4-level system, and likewise for K/"L/", so that the 44 design retains 
some confusion among the two-factor interactions. 

When using this method, remember that some two-factor interac- 
tions in the 2” system become main effects in the 4” system, while some 
three- and four-factor interactions become two-factor interactions. The 
alias system should be carefully checked before adopting a design. 


64.34 Hyper-graeco-latin Squares As Fractional Factorials. When all 
factors have 2 levels, we have seen (section 6A.22) that the main effects 
of as many as 7 factors can be estimated independently in an experiment 
with only 8 treatment combinations, provided that all interactions are 
negligible. When factors have more than 2 levels, designs that give 
the maximum reduction in size of experiment are obtained from graeco- 
latin squares (section 4.51), or more generally from sets of completely 
orthogonal latin squares. Table 6A.17 shows the 3 X 3 graeco-latin 
square, rewritten (on the right) as a plan for a 3* factorial in 9 units 
(1$ replicate). 


TABLE 6A.17 А Қ REPLICATE OF А 34 FACTORIAL 


Graeco-latin square Fractional factorial 
Ау Вз Co 0000 0112 0221 
B» Сі 4; 1011 1120 1202 
C; Аз Bi 2022 2101 2210 


In the rewritten form, the rows of the square represent the first factor, 
columns the second, latin letters the third, and suffixes the fourth. АП 
main effects of the 4 factors are orthogonal to one another. Since each 
main effect carries 2 d.f., the four main effects account for all the 8 
d.f. in the total sum of squares among the 9 treatment combinations in 
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the experiment. This result shows that 4 is the greatest number of 
factors at 3 levels each for which main effects can be estimated in 9 
units. This estimation is possible only if all interactions are absent. 

For factors at 4 levels, there exists a set of three completely orthogonal 
4 X 4 latin squares (6А.9). An experiment with 16 units will therefore 
accommodate 5 factors each at 4 levels, this being 1/64 replicate. Simi- 
larly, there is a design with 25 units (1/625 replicate) which gives esti- 
mates of 6 factors each at 5 levels. 

These results are presented for their mathematical interest, not be- 
cause the designs are to be recommended. Since main effects have two- 
factor interactions as aliases, such designs can rarely be used with safety. 
Even in the less extreme case in which the rows, columns, and letters of 
an ordinary p X p latin square represent different factors (giving а 1/p 
fraction of a р? factorial), main effects still have two-factor interactions 
as aliases, as was verified in section 6A.32 for the 33 factorial. Youden 
and Hunter (6A.10) have warned against the misuse of this design and 
have shown how to include some genuine replication in it. 


64.35 3 x 2” and 4 X 2" Factorials. If one factor is to be tested at 
3 levels and all others at 2 levels, a design can be obtained by an ex- 
pansion of the corresponding 2" design. Take, for instance, plan 6A.3 
for a 2° factorial in 16 units (1% replicate). If we call the additional 
3-level factor F, and test all these 16 treatment, combinations at each 
level of F, we have a 3 X 2° factorial in 48 units (14 replicate). The 
block sizes, of course, become three times as large as those in plan 6A.3. 
The two-factor interactions that are estimable are exactly the same as 
in that plan, CD, CE, and DE being lost in blocks of 12 units and DE 
in blocks of 24 units. The same method gives a 3 X 2° design in 96 
units. 

Plan 6A.20 shows a 4 X 2" factorial in 32 units (14 replicate). АП 
two-faetor interactions are estimable except in blocks of 8 units, in 
which 1 d.f. must be sacrificed. Plans for 4 X 2° and 42 X 2* factorials 
сап be constructed from the 2” series of plans as explained in section 
6A.33, but they are not presented here. 
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PLANS 
Plan 64.1 2 factorial in 8 units (14 replicate) 
Defining contrast: ABCD 
Estimable 2-factor interactions: AB = CD, AC = BD, AD = BC. 


(1) 

ab Effets 44. 
ac Main 4 
ad 2-factor 3 
be ч 
ba Total 7 
са 


abed 


Plan 6A.2 


PLANS 
25 factorial in 8 units (34 replicate) 
Defining contrasts: ABE, CDE, ABCD 
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Main effects have 2-factors as aliases. The only estimable 2-factors are AC = BD 


and AD = BC. 
а) 
ab Effects d.f. 
cd Main 5 
ace 2-factor 2 
bce - 
айе Total 7 
bde 
abed 
Plan 6A.3 25 factorial in 16 units (24 replicate) 
Defining contrast: ABCDE 
Blocks of 4 units 
Estimable 2-factors: All except CD, CE, DE (confounded with blocks), 
Blocks (1) (2) (3) (4) Effects d.f. 
——————————— Block 3 
(1) ac ae ad Main 5 
ab іс be bd 2-factor 7 
acde de cd се — 
bcde ае abcd abce Total 15 


CD, CE, DE confounded. 


Blocks of 8 units 
Estimable 2-factors: All except DE. 


Blocks of 16 units 
Estimable 2-factors: All. 


Combine blocks 1 and 2; and blocks 3 Combine blocks 1-4. 


and 4. ДЕ confounded. 


Effects 
Block 
Main 
2-factor 


Total 


d.f. 
1 
5 
9 


Effects d.f. 
Main 5 
2-factor 10 

Total 15 
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Plan 64.4 25 factorial іп 8 units (% replicate) 
Defining contrasts: ACE, ADF, BCF, BDE, ABCD, ABEF, CDEF 


Main effects have 2-factors as aliases. The only estimable 2-factor is the set AB = 
CD = EF. 


(1) 
acf Effects d.f. 
ade Main 6 
bce 2-factor (АВ = CD = EF) T 
bdf = 
аса Total 7 
abef 
cdef 

Plan 64.5 28 factorial in 16 units (14 replicate) 

Defining contrasts: ABCE, ABDF, CDEF 


Blocks of 4 units 


Estimable 2-factors: The alias sets AC = BE, AD = BF, AE = BC, AF = BD, 
CD = EF, CF = DE. 


Blocks (1) (2) (3) (4) Effects d.f. 
= Se — Block 3 
(1) acd ab ас] Main 6 
abce aef ce ade 2-factor 6 
abdf bcf df bcd = 
cdef bde abcdef bef Total 15 


AB, ACF, BCF confounded. 


Blocks of 8 units Blocks of 16 units 


Estimable 2-factors: Same as in blocks Estimable 2-factors: Same as in blocks 
of 4 units, plus the set АВ = CE = DF. of 8 units. 


Combine blocks 1 and 2; and blocks 3 — Combine blocks 1-4. 
апа 4. ACF confounded. 


Effects d.f. Effects d.f. 
Block 1 Main 6 
Main 6 2-factor 7 
2-factor 7 3-factor 2 
3-factor M — 

= Total 15 


Total 15 
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Plan 6A.6 25 factorial in 32 units (14 replicate) 
Defining contrast: ABCDEF 
Blocks of 4 units 
Estimable 2-factors: All except АЕ, BF, and CD (confounded with blocks). 
Blocks (1) (2 (3) (4) G (6) ( (8) 


(1) ab ас bc ae af ad bd 
abef ef de df bf be ce cf 
acde аса) abdf асе] cd abcd абс] асе 
bedf bede beef abde abcdef cdef bdef adef 


AE, BF, CD, ABC, ABD, ACF, ADF confounded. 


Effects d.f. 
Block 7 
Main 6 
2-factor 12 
Higher order 6 

Total 31 

Blocks of 8 units Blocks of 16 units 
Estimable 2-factors: All except CD. Estimable 2-factors: All. 


Combine blocks 1 and 2; blocks 3 and 4; Estimable 3-factors: ABC = DEF is lost 
blocks 5 and 6; and blocks 7 and 8. by confounding. The others are in alias 


CD, ABC, ABD confounded. pairs, e.g, ABD = СЕР, 
Combine blocks 1-4; and blocks 5-8, 
ABC confounded. 
Effects d.f. Effects d.f. 
Block 3 Block 1 
Main 6 Main 6 
2-factor 14 2-factor 15 
Higher order 8 3-factor 9 
Total 31 Total 31 
Blocks of 32 units 


Estimable 2-factors: All. 
Estimable 3-factors: These are arranged in 10 alias pairs. 
Combine blocks 1-8. 


Effects df. 
Main 6 
2-factor 15 
3-factor 10 


Total 31 
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Plan 6A.7 27 factorial in 8 units (б replicate) 


Defining contrasts: ABG, ACE, ADF, BCF, BDE, CDG, EFG, ABCD, ABEF, 
ACFG, ADEG, BCEG, BDFG, CDEF, ABCDEFG 


Main effects have 2-factors as aliases. No 2-factors are estimable. 


abed 
abef Effects 44. 
acfg Main 7 
adeg - 
bceg Total 7 
bdfg 
cdef 
Plan 64.8 27 factorial in 16 units (16 replicate) 
Defining contrasts: ABCD, ABEF, ACEG, ADFG, BCFG, BDEG, CDEF 


Blocks of 4 units 


Estimable 2-factors: Only the alias sets AE = BF = CG; AF = BE = DG; AG = 
CE = DF; BG = DE = CF. 


Blocks (1) (2) (3) (4) Effects d.f. 
----------- Block 3 
(1) ад ај ade Main 7 
efg cdg bdf bce 2-factor 4 
abed abef aceg adfg Higher order 1 
abcdefg cdef bdeg Ысјд — 
Total 15 
АВ = CD = EF, 
АС = BD = EG, 


AD = BC = FG confounded. 


Plan 64.9 27 factorial in 16 units (1% replicate) 
Defining contrasts: ABCD, ABEF, ACEG, ADFG, BCFG, BDEG, CDEF 
Blocks of 8 units 
Estimable 2-factors: Same as in blocks of 4 units, plus the alias sets AB = CD = 
ЕР, АС = Вр = EG, AD = BC = FG. 


Blocks (1) (2) 


(1) ам Effects d.f. 
abcd acf Block 1 
abef ade Main 7 
aceg bce 2-factor 1. 
adfg baf = 
befg cdg Total 15 
bdeg efg 
cdef abcdefg 
ABG confounded. 

Blocks of 16 units 


Estimable 2-factors: Same as in blocks of 8 units. 
Combine blocks 1 and 2 of the plan for blocks of 8 units, 


Effects df. 
Main 7 
2-factor 7 
Higher order 1 
Total 15 
Plan 6A.10 27 factorial in 32 units (14 replicate) 


Defining contrasts: ABCDE, ABCFG, DEFG 
Blocks of 4 units 


Estimable 2-factors: AB, AC, BC, and DF = EG аге lost by confounding. Ail 
other 2-factors аге estimable, except that DE = FG and DG = ЕР, so that members 
of these alias pairs cannot be separated. 


Blocks (1) (2) (3) (0 ©) © M ® 


(1) de ab сід ас bdg be adg 
defg fg cdf cef Бај bef ай aef 
abcdf abcdg сед abde beg acde aeg befg 
abceg abcef abdefg abfg acdefg acfg bedefg bede 


AB, AC, BC, DF = EG, ADG, BDG, CDG confounded. 


Effects d.f. 
Block E 
Main 7 
2-factor 14 


Higher order 3 


Total 31 
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Plan 6A.11 27 factorial in 32 units (14 replicate) 
Defining contrasts: ABCDE, ABCFG, DEFG 
Blocks of 8 units 


Estimable 2-factors: All except DF = EG (confounded with blocks) However, 
DE = FG and DG = EF are alias pairs which cannot be separated. 


Blocks (1) (2) (3) (4) 


(1) bdg ab de Effects d.f. 
bc bef ас Л Block 3 
adf сеј һај adg Main 7 
aeg abfg beg aef 2-factor 17 
defg асјд cdf bede Higher order 4 
abcdf  abde сед bcfg — 
abceg acde acdefg abcdg Total 31 


bedefg cdg abdefg abcef 
DF = EG, ADE, AEF confounded. 


Blocks of 16 units Blocks of 82 units 


Estimable 2-factors: All, except that DE Estimable 2-factors: Same as in blocks 
= FG, DG = ЕР, ала DF = EGarealias of 16 units. 
pairs. 


Combine blocks 1 and 2; and blocks 3 Combine blocks 1-4. 
and 4. AEF confounded. 


Effects df, Effects d.f. 
Block 1 Main 7 
Main 7 2-factor 18 
2-factor 18 Higher order 6 


Higher order 5 = 


= Total 31 
Total 31 


Plan 6A.12 


PLANS 


27 factorial in 64 units (14 replicate) 


Defining contrast: ABCDEFG 


Blocks of 4 units 


283 


Estimable 2-factors: All except AB, AC, BC, EF, EG, and FG (confounded with 


blocks). 
Blocks 


a) (2) (3) (4) (5) © 0 (8) 
(1) ab ac be ae be ce abce 
ака са bd ad bede асіе abde de 
defg аһаејд acdefg bedefg айд bdfg cdfg abcdfg 
abcefg «0 М0 ајд befg асјд а/ fg 
(9) (10) (а) (2 (13) (14) (15) (16) 
af bf of abf еј abef асеў beef 
bcdf ас] аш df abcdef cdef bdef adef 
adeg bdeg cdeg | abcdeg dg abdg acdg bcdg 
bceg — аса abeg ед abeg cg bg ag 


АВ, AC, BC, EF, EG, FG, ADE, ADF, ADG, ВРЕ, BDF, 


BDG, CDE, CDF, CDG confounded. 


Effects d.f. 
Block 15 
Main 7 
2-factor 15 


Higher order 26 


Total 63 
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Plan 6A.13 27 factorial in 64 units (14 replicate) 
Defining contrast: ABCDEFG 
Blocks of 8 units 


Estimable 2-factors: All. 


Estimable 3-factors: All except ABC, ADE, AFG, BDF, BEG, CDG, CEF (con- 


founded with blocks). 


Blocks (1) (2) (3) (4) (5) (6) (7) (8) 
(1) be ас ab ag af ae ad 
abdg de df dg bd be bf bg 
abef fg eg ef ce cd cg cf 
acdf abdf ае аса abcf аж) ака — abce 
aceg abeg аб) ас/0 adef adeg айд — aefg 
bede ай сід — bedf befg bdfg bdeg —bdef 
bcfg acef beef bceg cdfg cefg cdef — cdeg 
ејд. bedefg acdefg abdefg abedeg abcdef abcefg abcdfg 
Effects d.f. 
Block 7 
Main 7 
2-factor 21 
3-factor 28 
Total 63 


Blocks of 16 units 


Estimable 2-factors: All. 


Estimable 3-factors: All except ABC, 


ADE, AFG (confounded). 


Combine blocks 1 and 2; blocks 3 and 4; 


blocks 5 and 6; and blocks 7 and 8. 


Effects 
Block 

Main 
2-factor 
Higher order 


Total 


d.f. 


3 
ft 
21 


63 


Blocks of 32 units 
Estimable 2-factors: АП. 


Estimable 3-factors: АП except ABC 
(confounded). 


Combine blocks 1-4; and blocks 5-8. 


Effects d.f. 
Block 1 
Main T 
2-factor 21 
3-factor 34 

Total 63 


Blocks of 64 units 
Estimable 2-factors: All. 
Estimable 3-factors: АП, 
Combine blocks 1-8. 


Effects 
Main 
2-factor 
3-factor 


d.f. 

7 
21 
35 


Total 63 
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Plan 6А.14 28 factorial іп 16 units (М replicate) 


Defining contrasts: ABCD, ABEF, ABGH, ACEH, ACFG, ADEG, ADFH, BCEG, 
BCFH, BDEH, BDFG, CDEF, CDGH, EFGH, ABCDEFGH. 
Blocks of 4 units 


Estimable 2-factors: Only the 4 alias sets AE = BF = CH = DG; AF = ВЕ = 
CG = DH; AG = ВН = CF = DE; АН = ВС = СЕ = DF. Except in special 
circumstances, only main effects are estimable. 


Blocks (1) (2)-. (8). (4) Effects d.f. 
SS a oe Block 3 
(1) abef adeg aceh Main 8 
abcd abgh adfh acfg 2-factor 4 
efgh cdef beeg bdeh — 
abcdefgh cdgh befh bdfg Total 15 


AB, AC, AD confounded. 


Blocks of 8 units Blocks of 16 units 


Estimable 2-factors: As in blocks of 4 Estimable 2-factors: As in blocks of 8 
units, plus the alias sets AC = BD = ЕН units, plus the alias set AB = CD = ЕР 
= FG; AD = BC = EG = FH. = GH. 


Combine blocks 1 and 2; and blocks 3 Combine blocks 1-4. 
and 4. АВ confounded. 


Effects d.f. Effects d.f. 
Block 1 Main 8 
Main 8 2-factor 7 
2-factor 6 =i 

ж Total 15 


Total 15 
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Pian 6А.15 28 factorial in 32 units (15 replicate) 
Defining contrasts: BCDH, BDFG, CFGH, ABCEF, ABEGH, ACDEG, ADEFH 
Blocks of 4 units 


Estimable 2-factors: All interactions of A and of Е. All other 2-factors are lost by 
confounding. 


Blocks (1) (2) 9 (4) ©) (6) (7) (9 
(1) abd ША cdf ајд аһ beg fh 
ae bde abcf абв ејд ceh adfh асай 


abcdfgh cfgh ^ bcef begh bedh — bdfg аһ cdeg 
bedefgh acefgh adegh acdef abcdeh abdefg abceg abefh 


BD, BF, BH, CF, DF, DH, FH and their aliases confounded. 


Effects d.f. 
Block 7 
Main 8 
2-factor 13 
Higher order 3 
Total 31 
Blocks of 8 units Blocks of 16 units 


Estimable 2-factors: All interactions of Estimable 2-factors: As in blocks of 8 
А and of E, and the alias pairs BC = units, plus the alias sets CG = FH, BD 
DH, BF = DG, BG = DF, BH = CD. = CH = FG, 


Combine blocks 1 and 2; blocks 3 and 4; Combine blocks 1-4; and blocks 5-8. 
blocks 5 and 6; and blocks 7 and 8. BD, СЕ confounded. 
CF, FH confounded. 


Effects d.f. Effects d.f. 
Block 3 Block 1 
Main 8 Main 8 
2-factor 17 2-factor 19 
Higher order 3 Higher order 3 


Total 31 Total 31 
Blocks of 32 units 


Estimable 2-factors: All interactions of A and of Е, plus the alias sets BC = DH, 
BF = DG, BG = DF, BH = CD, СС = FH, CF = GH, BD = CH = FG. 


Combine blocks 1-8, 


Effects d.f. 
Main 8 
2-factor 20 
Higher order 3 


Total 31 


PLANS 


Plan 6A.16 
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28 factorial in 64 units (14 replicate) 


Defining contrasts: ABCEG, ABDFH, CDEFGH 


Blocks of 4 units 
Estimable 2-factors: All except AF, AH, BC, BG, CG, DE, FH (confounded with 
blocks). 
Blocks (1) (2) (8) а 6) (6) (7) (8) 
(1) adg ach beh eg fh bef acf 
adefh abce М4 abdf bed ade abdh ^ bgh 
bcdeg — efgh cdef cdgh adfgh abeg cdfg | cdeh 
abcfgh bedfh abdegh асејд ^ abcefh bedefgh acegh — abdefg 
(9) (10) (1) (12) (13) (14) (15) (16) 
ab ce afg df acd cg dh agh 
cfgh bdg bch aeh abeg bde aef bef 
acdeg acdfh degh bcefg cefh аһ bcegh ејд 
bdefh abefgh abcdef abcdgh bdfgh ^ acdefgh abcdfg abcdeh 


AF, AH, BC, BG, CG, DE, FH, ACD, ADG, BEF, BEH, CDF, CEF, 


DFG, EFG confounded. 


Effects d.f. 
Block 15 
Main 8 
2-factor 21 
Higher order 19 

Total 63 


Blocks of 8 units 


Estimable 2-factors: All except BC and 
FH (confounded with blocks). 


Combine blocks 1 and 2; blocks 3 and 4; 
blocks 5 and 6; blocks 7 and 8; blocks 9 
and 10; blocks 11 and 12; blocks 13 and 
14; and blocks 15 and 16. BC, FH, 
ACD, BEF, BEH, CEF, DFG con- 
founded. 


Effects d.f. 
Block T 
Main 8 
2-factor 26 
Higher order 22 

Total 63 


Blocks of 16 units 
Estimable 2-factors: All. 


Combine blocks 1-4; blocks 5-8; blocks 
9-12; and blocks 13-16. ACD, BEF, 
DFG confounded. 


Effects d.f. 
Block 3 
Main 8 
2-factor 28 
Higher order 24 

Total 63 


288 FACTORIAL EXPERIMENTS IN FRACTIONAL REPLICATION 


Plan 6А.16 (Continued) 


Estimable 2-factors: All. 
Combine blocks 1-8; and blocks 9-16. 


Blocks of 82 units 


ACD confounded. 


Blocks of 64 units 


Estimable 2-factors: All. 
Combine blocks 1-16. 


Effects 4.1. Effects d.f. 
Block 1 Main 8 
Main 8 2-factor 28 
2-factor 28 Higher order 27 
Higher order 26 = 
= Total 63 
Total 63 
Plan'6A.17 25 factorial in 128 units (14 replicate) 
Defining contrast: ABCDEFGH 
For blocks of 8 units, see end of plan. 
Blocks of 16 units 
Estimable 2-factors: All. 
Estimable 3-factors: All, 
Blocks (1) (2) (3) (4) (5) (6) (7) (8) 
(1) аһ fh ac af ch ah cf 
abcd bdfh ^ abcdfh bd bcdf аһ bedh — abáf 
adeg abefgh abefgh abeg defg begh degh befg 
bceg cdefgh  bcefgh cdeg abcefg  acdegh  abcegh acdefg 
adfh ab ad abfh dh bf df bh 
befh са be cdfh аһ acdf abof асаь 
efgh aceg eg acefgh aegh сејд aefg cegh 
abcdefgh bdeg ађсіед bdefgh bedegh abdefg bcdefg abdegh 
abgh ef abfg eh bfgh ае bg aefh 
aceh adfg асе} adgh — cefh dg ce dfgh 
bdeh befg bdef begh abdefh abeg ^ abde abcfgh 
cdgh abcdef cdfg ^ abcdeh acdfgh bede acdg — bcdefh 
abef gh abeh — fg be ајд ^ befh ag 
acfg adeh ^ acgh adef сд defh cfgh de 
bdfg bceh — bdgh beef афід abcefh abdfgh abce 
cdef abedgh cdeh abcdfg acde bedfgh acdefh bedg 
ABCD, АВЕР, ACFG, ADEG, BCEG, BDFG, CDEF confounded. 
Effects 44. 
Block 7 
Main 8 
2-factor 28 
3-factor 56 
Higher order 28 
Total 127 


PLANS 


Plan 6A.17 (Continued) 
Blocks of 32 units 

Estimable 2-factors: All. 

Estimable 3-factors: All. 


Combine blocks 1 and 2; blocks 3 and 4; 
blocks 5 and 6; and blocks 7 and 8. 
ABCD, ABEF, CDEF confounded. 


Effects df. 
Block 3 
Main 8 
2-factor 28 
3-factor 56 


Higher order 32 


Total 127 


Blocks of 128 units 
Estimable 2-factors: All. 
Estimable 3-factors: All. 
Combine blocks 1-8. 


Effects d.f. 
Main 8 
2-factor 28 
3-factor 56 


Higher order 35 


Total 


Blocks of 64 units 
Estimable 2-factors: All. 
Estimable 3-factors: All. 


Combine blocks 1-4; and blocks 5-8. 
ABCD confounded. 


Effects d.f. 
Block 1 
Main 8 
2-factor 28 
3-factor 56 
Higher order 34 

Total 127 

Blocks of 8 units 


Estimable 2-factors. All except ЕС and 
FH (confounded). Start with the plan 
for blocks of 32 units. The first 4 rows 
of blocks 1 and 2 form the first block of 
8 units: i.e., this block contains 1, abcd, 
adeg, bceg, acfh, bdfh, abefgh, cdefgh. Sim- 
ilarly, rows 5-8 of blocks 1 and 2 give 
the second block, rows 9-12 the third and 
rows 13-16 the fourth. The remaining 
12 blocks are formed likewise from blocks 
3 and 4; blocks 5 and 6; and blocks 7 
and 8. 


Effects df. 
Block 15 
Main 8 
2-factor 26 


Higher order 78 


Total 127 
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Plan 6A.18 3* factorial in 27 units (14 replicate) 
Defining contrasts: 2 d.f. from ABCD, equivalent to putting D = ABC(Y) 
Blocks of 9 units 


Estimable 2-factors: 16 of the 24 d.f. are clear. CD(J) is lost by confounding. Also 
AB(J) = CD(J); АС(1) = BD(1); AD(1) = ВС(1). 


Blocks (1) (2) (3) Effects d.f. 
——— Block 2 
0000 0021 0012 Main 8 
0122 0110 0101 2-factor 16 
0211 0202 0220 = 
1022 1010 1001 Total 26 


1111 1102 1120 


1200 1221 1212 If all interactions of D are negligible, the 


2011 2002 2020 analysis may be written: 

2100 2121 2112 Effects d.f. 
2222 2210 2201 Block 2 
Main 8 
AB, AC, BC 12 

Error (from inter- 
actions of D) 4 
Total 26 

Blocks of 27 units 


Estimable 2-factors: As in blocks of 9 units, plus CD(Z). 


Combine blocks 1, 2, and 3. See section 6A.32 for analysis of variance. 


Plan 64.19 


Combine blocks 1-3; blocks 4-6; blocks 


7-9. 


Estimable 2-factors: All except AE(J), which is confounded with blocks, 


Blocks 


PLANS 


3° factorial in 81 units (14 replicate) 
Defining contrasts: 2 d.f. from ABCDE 


(1) 


(2) 


Blocks of 9 units 


(3) 


(4) 


(5) 


(6) 


(7) 


(8) 


ab 


cde 


Blocks of 27 units 
Estimable 2-factors: All. 


Effects 
Block 

Main 
2-factor 
Higher order 


Total 


2-factor 
Higher order 


d.f. 


2 
10 
40 
28 


80 


120 
212 
001 
200 
022 
111 
010 
102 
221 


Total 


Combine blocks 1-9. 


(9) 
cde 


210 
002 
121 
020 
112 
201 
100 
222 
011 


Blocks of 81 units 
Estimable 2-factors: All. 


Effects 
Main 

2-factor 
Higher order 


Total 
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Plan 6A.20 4 X 2 factorial in 32 units (24 replicate) 
Defining contrast: A” BCDE 
Blocks of 8 units 
Estimable 2-factors: All except DE (confounded with blocks). 
Blocks а) (2) (3) (4) 
ab | cde cde cde cde 
00 | 100 010 111 001 Effects d.f. 
01 | 011 101 000 110 Block 3 
10 | 011 101 000 110 Main 7 
11 | 100 010 111 001 2-factor 17 
20 | 111 001 100 010 Higher order 4 
21 | 000 110 011 101 — 
30 | 000 110 011 101 Total 31 
31 | 111 001 100 010 
Blocks of 16 units Blocks of 32 units 
Estimable 2-factors: All. Estimable 2-factors: All. 
Combine blocks 1, 2; and blocks 3, 4. Combine blocks 1, 2, 3, and 4. 
Effects d.f. Effects d.f. 
Block 1 Main 7 
Main 1 2-factor 18 
2-factor 18 Higher order 6 


Higher order 5 — 


-- Total 31 
Total 31 


CHAPTER 12 


Factorial Experiments with Main Effects 
Confounded: Split-plot Designs 


7.1 Тһе Simple Split-plot Design 


7.11 Description. In field experiments an extra factor is sometimes 
introduced into an experiment by dividing each plot into a number of 
parts. For example, if the experiment is planned originally to test a 
factor A with five levels, the division of each plot into halves permits the 
inclusion of an extra factor B at two levels. Within each plot the two 
levels of B are allotted at random to the two sub-plots. If the whole 
plots are in a randomized block design, the plan (after randomization) 
might appear as shown in table 7.1. It is worth noting the difference be- 


TABLE 7.1 EXAMPLE OF A SPLIT-PLOT DESIGN 
Rep. 1 Rep. 2 Rep. 3 


а а а а а а а a а; о; а з 00 % u 
boj Ы [bo| [bo] | bo bi] [bi] [bo ы bo bi| | bo ы ы bi 
p bo} [bi] [bı] |b: bo| [bo] | br ы| ы ы ы ы ы bo 


tween this arrangement and ordinary randomized blocks. In the latter 
the ten treatment combinations are assigned to the ten sub-plots in a 
replication completely at random. Here we have a more orderly assign- 
ment in which the two treatment combinations that have any given level 
of A always appear in the same whole-plot. 

This type of arrangement is common in industrial experimentation, 
although the connection may not at first be realized. Frequently, one 
Series of treatments requires rather a large bulk of experimental ma- 
terial, while another series can be compared with much smaller amounts. 
For instance, the comparison of different types of furnace for the prep- 
aration of an alloy would use much greater amounts of alloy than the 
comparison of different types of mould into which the alloy might be 
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poured. In an experiment in which both factors аге to be tested, the 
natural procedure is to take the material prepared in any furnace, and 
pour some of it into each mould. That is, material prepared in one 
furnace at one time provides a complete replication for the comparisons 
among moulds, just as the plot containing any level of A provides a com- 
plete replication of the factor B. Another instance is the comparison of 
different machines for milking dairy cows. Each machine would neces- 
sitate rather substantial amounts of milk, whereas other comparisons, 
for example on the best method of pasteurizing or of cooling, could be 
conducted with a much smaller amount of milk per treatment. The 
produce from any machine could be subdivided for these subsequent 
tests. 

We may describe this design in another way that brings out more 
clearly its relation to the confounded factorial designs discussed in chap- 
ter 6. If the sub-units are regarded as the experimental units, it is seen 
that the treatments ao, aj, . . . , a4 are applied to groups or blocks of two 
units. Differences among these blocks are confounded with differences 
among the levels of A; i.e., the main effects of A are confounded. Ac- 
cordingly, the split-plot design is sometimes considered as one in which 
certain main effects are confounded, as contrasted with the designs in 
chapter 6, where the confounding is restricted to interactions. 


7.12 Nature of the Experimental Error. In the statistical analysis, 
account must be taken of the fact that the observations from different 
sub-units in the same unit may be correlated. In field experiments this 
correlation is just a reflection of the fact that neighboring pieces of land 
tend to be similar in fertility and in other agronomic properties. In in- 
dustrial experiments the same correlation may be present, because any 
factor that affects the whole batch of alloy prepared in one furnace at 
one time will tend to create similarity among smaller batches poured 
from it. 

The mathematical analysis used to examine the effects of this correla- 
tion will be illustrated for the experiment in table 7.1. Let $ refer to the 
replication, j to the level of A, and k to that of В: for sub-units in the 
same unit, the 7 and j subscripts will be the same. The assumption is 
made that a correlation р exists between the experimental errors е; and 
буа for any two sub-units in the same unit. Sub-units in different units 
are assumed to be uncorrelated. Mathematically, we have 


Е(едем) = ра;  E(eijkestu) = 0 


We now consider the effects of this model on the most important treat- 
ment comparisons. The main effects of A are calculated entirely from 
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unit totals ог means. With two sub-units per unit, the error variance of 
a unit total is 


Е(ел + ед)? = (ел?) + (ер? + 2Е(е;зле:у2) = o? + o? + 2р0? 
= 20?(1 + p) 


The factor 2 may be regarded as representing the effect of adding over 
2 sub-units. Consequently, for the main effects of A, the appropriate 
error variance per sub-unit is c?(1 + р). If there are В sub-units per 
unit, the corresponding variance works out at o7[1 + (8 — 1)p]. 

The main effects of B, on the other hand, are derived from the dif- 
ference between the two sub-units in a unit. For this we have 


E(ei — е)? = 207(1 — p) 


Thus the effective variance per sub-unit applicable to the main effects 
of B is о?(1 — p). This expression remains unchanged when there are 
8 sub-units per unit. This variance also applies to any component of the 
АВ interaction, since such components involve comparisons of (b; — bo) 
at different levels of A. 

For other treatment comparisons, the basic error variance may be 
different from either of the two expressions above. Consider, for in- 
stance, (agbo — a,b), а comparison of аз with a; at the zero level of В. 
In any replication the 2 sub-units involved are in different units, and are 
therefore independent. The variance of their difference is therefore 
207, and the basic variance per sub-unit is c?. However, the appropriate 
variance for all other comparisons of this type can be derived from the 
Базе variances c?[1 + (В — 1)p] and c?(1 — p). As will be seen, the 
analysis of variance gives unbiased estimates of these two variances, 
and from these, unbiased estimates of any particular variance can be 
obtained. 

In practice, p is nearly always positive. The result is that the main 
effects of A (the factor applied to the units) are less precisely estimated 
than those of B or than the AB interaction. 

The analysis of variance is fairly easy. For the example in table 7.1 we 
first compute the 15 plot totals. Their sum of squares of deviations is 
partitioned in the usual way into 2 d.f. for replications, 4 for the main 
effects of A, and 8 for the experimental error applicable to a whole-plot. 
The mean square for the latter is an unbiased estimate of o°(1 + p). 
All computations are divided by 2 to convert them to a sub-unit basis. 

Next take the difference (bı — bo) on every unit. The 15 differences 
provide 1 d.f. which represents the main effect of В, 4 d.f. which represent 
the AB interactions, and the remaining 10 d.f. whose mean square gives 
an unbiased estimate of the sub-plot error variance o°(1 — p). All sums 
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of squares are again divided by 2. Тһе complete separation of degrees 
of freedom is shown in table 7.2. 


TABLE 7.2 ANALYSIS OF VARIANCE FOR THE SPLIT-PLOT EXPERIMENT IN 


TABLE 7.1 

d.f. 

Whole plots 
Replications 2 
A 4 
Whole-plot error 8 
Total 14 

Sub-plots 

B 1 
AB 4 
Sub-plot error 10 
Grand total 29 


The point to be noted is that the whole-plot error is computed en- 
tirely from plot totals, and the sub-plot error entirely from the differences 
between sub-plots in the same plot. It will be seen that the grand total 
of the degrees of freedom is 29. As might be expected, the corresponding 
sum of squares is the sum of squares of deviations of the 30 observations 
from their mean. A computation of this quantity provides a check on all 
sums of squares. In practice, particularly with more than 2 sub-unit 
treatments, we usually find the sub-plot error by subtraction. 


7.13 Comparison with Randomized Blocks. Тһе experiment in table 
7.1 might be arranged in ordinary randomized blocks, with 3 blocks of 
10 treatments each. In a consideration of the relative merits of the two 
arrangements, the following points are relevant. 

1. With the split-plot design, usually the B and the AB effects are 
estimated more precisely than the A effects. Moreover, the number of 
degrees of freedom available for the experimental error m.s. is smaller 
for whole-unit comparisons than for sub-unit comparisons. 

2. It can be shown that the average experimental error over all treat- 
ment comparisons is the same for both designs. Consequently there is no 
net gain in precision resulting from the use of the split-plot design; the 
increased precision on B and AB is obtained by the sacrifice of precision 
on A. For tests of significance or the construction of confidence limits 
the randomized block design holds a slight advantage on the average 
since it provides more degrees of freedom for the estimate of the single 
error variance that it requires. For instance, the experiment cited has 
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8 d.f. for the whole-unit error and 10 d.f. for the sub-unit error, whereas 
randomized blocks would provide 18 d.f. 

3. As we have indicated, the chief practical advantage of the split- 
plot arrangement is that it enables factors that require relatively large 
amounts of material and factors that require only small amounts to be 
combined in the same experiment. If the experiment is planned to in- 
vestigate the first type of factor, so that large amounts of material are 
going to be used anyway, factors of the second type can often be in- 
cluded at very little extra cost, and some additional information obtained 
very cheaply. 

To summarize, the split-plot design is advantageous if the B and AB 
effects are of greater interest than the A effects, or if the A effects cannot 
be tested on small amounts of material. 

Two disadvantages have been mentioned by experimenters. Some- 
times the whole-unit error is much larger than the sub-unit error. It 
may happen that the effects of A, though large and exciting, are not 
significant, whereas those of B, which are too small to be of practical 
interest, are statistically significant. The experimenter tends to be 
uncomfortable in reporting results of this type. Secondly, the fact that 
different treatment comparisons have different basic error variances 
makes the analysis more complex than with randomized blocks, espe- 
cially if some unusual type of comparison is being made. 

When the number of replications and the experimental conditions are 
suitable, the whole units may be arranged ina latin square. A split-plot 
latin square, which eliminates the error variation arising from two types 
of grouping, may be preferable to randomized blocks. Summarizing 22 
field experiments in latin squares where the plots were split into halves, 
Yates (7.1) found a substantial net increase in precision over ran- 
domized blocks. The superiority of the latin square was so pronounced 
that even the whole-plot comparisons would have been less precisely de- 
termined in randomized block designs. Factorial combinations that lend 
themselves to the use of split-plot latin squares are the 5 X 2, 5 X 3, 
5 Х4,6 Х2,6 Х8,7 Х2,7 Х8,8 Х2. 


7.14 Randomization. The treatments applied to the units are тап- 
domized according to the instructions for the design (e.g. randomized 
blocks, latin square, ete.) in which the units are arranged. The treat- 
ments applied to the sub-units are allotted at random within each unit. 
A separate randomization is carried out for each unit. 


7.15 Statistical Analysis. If the factor A (applied to the units) con- 
tains а levels, and the factor В (applied to the sub-units) contains 0 
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levels, the subdivision of degrees of freedom in the analysis of variance is 
shown in table 7.3. Тһе details of the analysis are illustrated in section 
7.17: 


TABLE 7.3 PARTITION OF DEGREES OF FREEDOM FOR А SPLIT-PLOT DESIGN 


Units arranged in randomized blocks Units arranged in a latin square 


(r replicates) (г = a replicates) 
Units d.f. Units d.f. 
Blocks (т — 1) Rows (а — 1) 
A (а — 1) Columns (a — 1) 
Error (а) (а — 1)(r — 1) А (« — 1) 
------ Еггог (а) (а — 1)(« — 2) 
Total (ra — 1) = 
То{а1 (a? — 1) 
Sub-units Sub-units 
B (8-1) В (8 - 1) 
АВ («-1)8-1) АВ (а — 1)(8 — 1) 
Error (b) a(r — 1)(8 — 1) Error (Б). ala — 1)(8 — 1) 
Total ra(8 — 1) Total o*(8 — 1) 


7.16 Standard Errors. Let E, and E, be the error mean squares for 
error (a) and error (b), respectively, on a sub-unit basis. For the treat- 
ment means, also expressed on a sub-unit basis, the standard errors 
shown in table 7.4 apply. The final comparison in table 7.4 contains 


TABLE 7.4 STANDARD ERRORS FOR THE SPLIT-PLOT DESIGN 


Treatment comparison з.е. 
Difference between two A means: e.g., [(a1) - (а)! Y 2E,/rB 
Difference between two B means: ев. [(b1) — (591 V 2E/ra 


Difference between two B means at the same level of А: 
e.g, (аһ) - (ао) МКЕУЛІ 


Difference between two A means at the same level of B or 
at different levels of B: 


өк» [(a1b1) — (aob1)] or [(a1b1) — (ад! М2[(8 — 1) Еь + Eal/rB 


both the main effect of A and the AB interaction; consequently the ap- 
propriate error is a weighted mean of Е, апа Ey. This error also applies 
to the difference between two A means which have different levels of B. 
In such cases the ratio of the treatment difference to its standard error 
does not follow the distribution. For practical purposes the approx- 
imate rule of section 4.14 may be used, though this method gives slightly 
too few significant results. Let la; % be the significance levels of { corre- 
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sponding to the degrees of freedom in E, and Ey, respectively. The 
significance level of t is taken as 
(8 — )Еыь + Eata 

(8 — ПЕЬ- Е, 
For an application, see the next section. 


7.17 Numerical Example. In an experiment on the preparation of 
chocolate cakes, conducted at Iowa State College (7.2), 3 recipes for 
preparing the batter were compared. Recipes I and II differed in that 
the chocolate was added at 40°C. and 60°C., respectively, while recipe 
III contained extra sugar. In addition, 6 different baking temperatures 
were tested: these ranged in 10? steps from 175? to 225?. Each time that 
а mix was made by any recipe, enough batter was prepared for 6 cakes, 
each of which was baked at a different temperature. Thus the recipes 
are the “whole-unit” treatments, while the baking temperatures are the 
“sub-unit” treatments. There were 15 replications, and it will be as- 
sumed that these were conducted serially according to a randomized 
blocks scheme: that is, one replication was completed before starting 
the next, so that differences among replicates represent time differences. 
(The notes suggest that this was done, though they are not quite ex- 
plicit.) 

А number of measurements were made on the cakes. The measure- 
ment presented here is the breaking angle. One half of a slab of cake is 
held fixed, while the other half is pivoted about the middle until breakage 
occurs. "The angle through which the moving half has revolved is read 
on а circular scale. Since breakage is gradual, the reading tends to have 
а subjective element. The data are shown in table 7.5. 

It is customary to compute the analysis of variance on a sub-unit 
basis. То avoid confusion, this should be clearly stated in the analysis 
of variance table itself. The calculations may be presented in three 
steps. 

Step 1. Analyze the whole-unit totals by the method appropriate to 
the design in which they are arranged. 


2 

Correction factor: eo, = 278,596 
2 Uer 2 

Total: (209 + Cort + (155)" _ 978,596 = 11,538 

2 2 21 2 

Replications: ere — 278,596 = 10,204 
2 9848 2 2844 2 

Recipes: өну сма + oy — 278,596 = 135 


Error (a), by subtraction: 11,538 — 10,204 — 135 = 1,199 
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TABLE 7.5 BREAKING ANGLES (DEGREES) 


Temperature 
Unit 
Rep. 175° 185° 195° 205° 215° 225° totals 
1 42 46 47 39 53 42 269 
2 47 29 35 47 57 45 260 
3 32 32 37 43 45 45 234 
4 26 32 35 24 39 26 182 
5 28 30 31 37 41 47 214 
6 24 22 22 29 35 26 158 
7 26 23 25 27 33 35 169 
Recipe I 8 24 33 23 32 31 34 177 
9 24 27 28 33 34 23 169 
10 24 33 27 31 30 33 178 
11 33 39 33 28 33 30 196 
12 28 31 27 39 35 43 203 
13 29 28 31 29 37 33 187 
14 24 40 29 40 40 31 204 
15 26 28 32 25 37 33 181 T 
— — ер. 
Totals 437 473 462 53 580 526 2981 io 
- 8 
1 39 46 51 49 55 42 282 820 
2 35 46 47 39 52 61 280 665 
3 34 30 42 35 42 218 599 
4 25 26 28 46 37 37 199 583 
5 31 30 29 35 40 36 201 517 
6 24 29 29 29 24 35 170 492 
$ 2%-2 25 ' 26 7% 150.98 164 493 
Recipe II 8 26 23 24 31 27 37 168 492 
9 27 26 32 28 32 33 178 519 
10 21 24 24 27 37 30 163 538 
11 20 27 33 31 28 33 172 557 
12 23 28 31 34 31 29 176 574 
13 32 35 30 27 35 30 189 502 
14 23 25 22 19 21 35 145 479 
15 21 21 28 26 27 20 143 ЭБ 
= -- -- 86 
Totals 403 441 476 482 517 529 2848 
1 46 44 45 46 48 63 292 
2 43 43 43 46 47 58 280 
Б] 33 24 40 37 41 38 213 
4 38 41 38 30 36 35 218 
5 21 25 31 35 33 23 168 
6 24 33 30 30 37 35 189 
k 7 20 21 31 24 30 33 159 
Recipe Ш 8 24 23 21 24 21 35 148 
9 24 18 21 26 28 28 145 
10 26 28 27 27 35 35 178 
11 28 25 26 25 38 28 170 
12 24 30 28 35 33 28 178 
13 28 29 43 28 33 37 198 
14 19 22 27 25 25 35 153 
15 21 28 25 25 31 25 155 
Totals 419 434 476 463 56 536 2844 
Temp. totals 1259 1348 144 1448 1613 1591 8673 
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Step 2. This concerns the sub-unit treatments. Their main effects are 
obtained directly. 
(1259)? + (1348)? + ---+ (1591)? 


Temperatures: 48 — 278,596 — 2100 


'The sum of squares for interactions between sub-unit and whole-unit 
treatments is found by subtraction. First calculate the total s.s. for the 
two-way table that shows both sets of treatments. 


Total treatments: 
(437)? + (478)? ++++-+ (516)? + (536)? 
15 


— 278,596 = 2441 
Then, 
Recipes X temperatures: 2441 — 135 — 2100 = 206 
Step 3. Compute the total s.s. among all sub-units. 
Total: (42)? + (47)? +--+ (85)? + (25)? — 278,596 = 18,143 
Я The sum of souares for error (5) is then found by subtraction in table 
6. 


TABLE 7.6 ANALYSIS OF VARIANCE OF BREAKING ANGLES (ON А SUB-UNIT 
BASIS) FOR THE EXPERIMENT ON CHOCOLATE CAKES 


Source of variation d.f. 8.8. m.8, F 
Replications 14 10,204 
Recipes 2 135 67.5 1.58 
Еттог (а) 28 1,199 42.8 
Temperatures 5 2,100 420.0 20.49 

Linear regression ii 1,967 1,967.0 95.95 

Deviations 4 133 33.2 1.62 
Recipes X temperatures 10 206 20.6 1.00 
Error (b) 210 4,299 20.5 

Total 269 18,143 


The mean square for recipes, while above that for error (a), does not 
approach the 5% level. Temperature effects are highly significant as 
compared with error (b). Since there is a fairly steady increase in the 
breaking angle with increased temperature, the sum of squares has been 
divided into the component due to a linear regression on temperature 
and that due to deviations from the straight line. The regression coeffi- 
cient amounts to an increase of 1.6° in the breaking angle for each 10° 
rise in baking temperature. The mean square for deviations is not sig- 
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nificant, though it is higher than expectation. There is no indication of 
any interaction. It will be observed that the error (a) m.s. is about twice 
as large as that for error (b). 

In table 7.7 are shown the treatment means, with the principal stand- 
ard errors as obtained from table 7.4. 


TABLE 7.7 BREAKING ANGLE MEANS (DEGREES) 


Recipe 
Recipe 175* 185° 195° 205° 215° 225° means 


I 29.1 31.5 30.8 33.5 38.7 35.1 33.1 

II 26.9 29.4 31.7 32.1 34.5 35.3 31.6 

ш 27.9 28.9 31.7 30.9 34.4 35.7 31.6 
Temp. means 28.0 30.0 31.4 32.2 35.9 35.4 32.1 


Standard error of difference between 
2(42.8) 
90 


‘Two temperature means: ^ |225 = 0.95 (210 d.f.) 


Two temperature means for one recipe: ve = 1.65 (210 d.f.) 


2 D . 
Two recipe means for a given temperature: ү ®® 44281 = 1.80 


The 5% levels of t are 2.05 and 1.97, respectively, for 28 and 210 d.f. 
Consequently, the 5% level for the last standard error above is 


(5)(20.5)(1.97) + (42.8)(2.05) _ 
(5) (20.5) + (42.8) B 


This value always lies between the two individual t-values. In practice 
it need rarely be calculated. 


Two recipe means: = 0.98 (28 d.f.) 


7.18 Missing Data. The formulae for inserting estimates of missing 
values have been developed by Anderson (7.3). Suppose first that the 
observation for a single sub-unit is missing, and that this sub-unit re- 
ceives the treatment combination (а). Let 


U — total for the unit containing the missing observations. 
(А;В;) = total of all sub-units that receive the treatment combination 


(аў). 
(А) = total of all observations that гесејуе the ith level of A. 
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Then the estimate to be inserted for the missing value is 
_ TU + B(A;B;) – (4) 
2-2 -4-1)8-1) 
Example. Suppose that the observation for recipe II, 195° temper- 
ature, in replication 2 is missing. (Its actual value is 47.) Then 
И = 280 — 47 = 233; (А;В;) = 476 — 47 = 429; 
(A;) = 2848 — 47 = 2801 
15(233) + 6(429) — 2801 3268 
(14)(5) 770 


- 47 


у 


By accident, the estimated value is the same аз the actual value. If 
several observations are missing, repeated use is made of the formula as 
described in section 4.25. 

In the analysis of variance 1 d.f. is subtracted from the error (b) d.f. 
for each missing observation. An unbiased estimate of E; is obtained, 
but the treatment sums of squares and that for error (a) are biased up- 
wards. If only a small fraction of the values is missing, it appears that 
these biases can be ignored; methods for obtaining unbiased estimates are 
given by Anderson (7.8). 

Standard errors. When there are missing observations, the formulae 
in table 7.8 are suggested for the differences between 2 means. 


TABLE 7.8 STANDARD ERRORS FOR SPLIT-PLOT EXPERIMENTS WITH 
MISSING DATA 


Treatment comparison в.е. 
Difference between two A means: V Еа + ЈЕЫ/тВ 
Difference between two B means: МЕ! + (f8/a)]/re 
Difference between two B means at the same 
level of A: У2Е41 + (/8/а)]/т 


Difference between two A means at the sam 


е 
level of В, or at different levels of В: V (2E4/78) + (2Es)[(8 — 1) + /8°]/тВ 


If only one value is missing in the experiment, and if a mean con- 
taining that value is compared with another mean, the factor f is 
1/2(r — 1)(8 — 1), and the formulae are exact, reference (7.3). 

With more than 1 missing observation, the value of f depends on the 
locations of the missing observations. The following approximation, 
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developed by С. 8. Watson, is correct for a number of cases but other- 
wise tends to be slightly too high. 


= 


k 
2(- d)(8 — k + c — 1) 
When counting the values of б, c, and d, as defined below, be sure to 


ignore all missing observations except those that occur in the 2 means 
that are being compared. 


k = number of missing observations. 

с = number of replications which contain 1 or more missing observa- 
tions. 

d = number of missing observations in the sub-unit treatment (а) 
that is most affected. 


For example, suppose that in the cake experiment the values for recipe 
I, 175° temperature, are missing in replications 1, 2, and 3, while those 
for recipe I, 185° temperature, are missing in replications 3 and 4, We 
are comparing the means for the 2 temperatures, taken over all recipes. 
Then k is 5, and c is 4. The sub-unit treatment with the most missing 
values is recipe I, 175° temperature, which has 3 missing, so that d is 3. 

The case where a complete unit is missing is discussed by Anderson 
(7.3). 


7.2 Repeated Subdivision 


7.21 Description. In order to include a new factor C at y levels, 
the sub-units may each be divided into sub-sub-units. There are three 
experimental error variances. Errors (а) and (b) have the same functions 
as described previously and are caleulated in the same manner, except 
that an extra divisor y is introduced into all sums of squares in order to 
present the analysis on a sub-sub-unit basis. Error (c), which will usually 
be the smallest of the three, applies to the C, AC, BC, and ABC effects. 
The process of subdivision may be carried as far as is convenient. 

The partition of degrees of freedom has already been given (table 7.3) 
for factors A and B. For C and its interactions. the partition is shown in 
table 7.9. 


TABLE 7.0 ANALYSIS OF VARIANCE FOR AND ITS INTERACTIONS 


Effect d.f. 

с (у= 1 

АС (а — 1)(у — 1) 
BC (8 – 1)(у – 1) 
АВС (а-06-1)-1) 


Error (с) а8(7 — (у — 1) 
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The computations in the analysis of variance are a straightforward 
extension of those for the simple split-plot design. The sum of squares 
for AC is obtained by calculating the total s.s. for the A by C two- 
way table, and subtracting the sums of squares due to A and C; and 
similarly for the other factorial effects in table 7.9. Finally, the sum 
of squares for error (c) is found by calculating the total s.s. among 
sub-sub-units, and subtracting the contributions from all other items in 
the analysis. Worked examples are given in references (7.4) and (7.5). 


7.22 Standard Errors. The formulae in table 7.4 for standard errors 
applicable to A and B effects remain valid apart from division by an ad- 
ditional factor ~/y. For the principal comparisons which involve C ef- 
fects, standard errors are shown in table 7.10. All treatment means and 
error mean squares are assumed to be on a sub-sub-unit basis. 


TABLE 7.10 STANDARD ERRORS FOR THE SPLIT-PLOT DESIGN WITH TWO 


SUBDIVISIONS 

"Treatment comparison в.е. 
(е) - (оо)! У 2Е,/тав 
[(атс!) — (аһ)! У 2Е,/тВ 
Фе) - (Бо)! V2E,/ra 
[(aibiey) — (a1519)] Y 2E./r 
(bier) - (oc))] or [(brex) — (boco)] V 2I(v — 1) Ec + Ebl/rey 
((@ibrex) — (a15oc1)] V2[(y — DE. + Es)/ry 
(азс!) — (аос)] оғ (алс) — (00с)] У 210 — 1) Ee + Ecl/rBv 
(ас) — (aobre)] У 28(у — ПЕ. + (8 — 1)Ёь + Eal/rBy 


7.3 Some Variants of the Split-plot Design 


7.31 Systematic Arrangement of the Treatments Applied to the Units. 
The device of subdividing each unit into a number of sub-units is very 
flexible, and can be used with any type of design in which the units are 
arranged. Some of the more useful variations that are possible are de- 
scribed in the remainder of this chapter. The first arises when the nature 
of the experiment makes it necessary or desirable to have the A treat- 
ments arranged in a systematic design. This arrangement has been 
used in experiments where the A treatments were varieties of wheat 
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known to mature at different dates, and the B treatments were prepara- 
tions applied to the seed before sowing. Since each variety was to be 
harvested when ripe, the field operations were made easier by planting 
the varieties in the order in which they would be ready for harvest. The 
field plan is illustrated in table 7.11, where а is the earliest variety and 
а5 the latest. 


TABLE 7.11 EXAMPLE OF SYSTEMATIC ARRANGEMENT OF TREATMENTS 
APPLIED TO UNITS 


41 а аз u a5 
pe мни LL Ax 
bof [bi fo: bo} [bo 


by| [bo] || fbi) fbr 


а а аз a as 


bo 
п 
by m bo 


bo 

а 
m bo bo m bo 
ш ты 


ыыы [bo] |ы 
ы-і АА - 1 шы 


The sub-unit treatments are randomized within each unit; the unit 
treatments, however, appear in separate strips. 

This arrangement provides no valid estimate of error for the A main 
‘effects, or for comparisons such as (aibo — аҙ%) that involve A effects. 
Thus the “‘whole-plot” analysis of variance is irrelevant. In the sub-plot 
analysis of variance, error (5) is still valid for testing the B effects and 
the AB interactions. One point should be watched. A comparison such 
as (a,b; — abo) is derived entirely from the first strip in the plan. Con- 
sequently, if the sub-plot error yariance differs from strip to strip, the 
use of the error (6) m.s. for testing this quantity is not justified; instead, 
it is necessary to compute a separate error using only data from the first 
strip. 

To summarize, systematic placement of the А treatments is advisable 
only where it is essential in order to conduct the experiment and where 
no test of the A main effects is required. 


7.32 Sub-unit Treatments in Strips. In a further variant the sub- 
unit treatments, instead of being randomized independently within each 


7.32 SUB-UNIT TREATMENTS IN STRIPS 307 


unit, are arranged in strips across each replication. For а 5 X 3 design 
the appropriate rearrangement (after randomization) might be as shown 
in table 7.12. 

This layout may be convenient for field experiments where it is neces- 
sary to test both factors on relatively large areas and to leave free access 


TABLE 7.12 DESIGN WITH SUB-UNIT TREATMENTS IN STRIPS 


Rep. I 
аз а аз а а 
by | 
bo | 
ы 
Analysis of variance 
Rep. П ағ. 
ay а ao аз аз Replications 2 
A 4 
by Error (a) 8 
B 2 
be Error (0) 4 
AB 8 
bo Error (c) 16 
Total 44 
Rep. Ш 
a аз ao аз а 
bg 
ы 
bo | 


at both ends. Similar conditions may apply in other types of experi- 
mentation. 

As with the ordinary split-plot design, the average precision over all 
treatment comparisons is the same as that of randomized blocks arranged 
on the sub-units. The present design sacrifices precision on the main ef- 
fects of A and B in order to provide higher precision on the interactions, 
which will generally be more accurately determined than in either ran- 
domized blocks or the simple split-plot design. Since in addition the 
numbers of degrees of freedom for estimating the errors of the A and B 
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effects are likely to be small, the design is not recommended unless prae- 
tical considerations necessitate its use or unless the interactions are the 
principal object of study. Both the A and B treatments should be ran- 
domized independently within each replication. 


TABLE 7.13 LATIN SQUARE DESIGN WITH SUB-UNIT TREATMENTS IN STRIPS 
a a в a4 a3 


bi 
bo 
bs 
a4 аз а аҙ а 
by ] | 
ME Fd [| к 
0 
Е 
h INE 724 1 Analysis of variance 
d.f. 
% а аз в а Rows 4 
Columns 4 
b A 4 
Error (a) 12 
pM 1 T 2 
Error (b) 8 
bo AB 8 
Error (с) 32 
a (i108 aud @ Total 74 
by 
е 
= 
b 
d | 
as S ME а а 
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In the statistical analysis, separate estimates of error are obtained for 
A, B, and AB. Both sets of main effects may be regarded as tested in 
randomized block designs, in which the sum of squares for replications 
(blocks) is the same for both designs. A worked example (on uniformity 
data) is given in reference (7.6). 

Formulae for the estimated standard errors of the differences between 
treatment means per sub-unit are given in table 7.14. Тһе mean squares 
Ea, Ey, and E, are assumed to be on a sub-unit basis. 


TABLE 7.14 STANDARD ERRORS WHEN SUB-UNIT TREATMENTS ARE IN STRIPS 


(а = number of levels of A; В = number of levels of В; г = number of replicates) 


Treatment comparison в.е. 
(ай) - (ад! М 2Е,/тВ 
1) - (59] V 2Ey/ra 


(аһ) — (aob)] VAB — ПЕ. + Ё„]/тВ8 
[(a1b1) — (a159)] М2[(« — ПЕ. + Е /та 


One of the factors may be arranged in a latin square. With a 5 X 3 
design in 5 replicates, a layout of this type is shown in table 7.13. 

The statistical analysis is straightforward. For the B treatments, the 
“blocks” of the randomized blocks design are the rows of the latin square. 
The formulae given above for the standard errors are applicable (with r 
equal to а). 


7.33 Sub-unit Treatments in a Latin Square. In certain cases the 
sub-unit treatments may be arranged in a latin square, with the prospect 
of a further increase in accuracy on sub-unit comparisons. There are 
two types of layout, of which the first will be described in this section; 
the second, which is used when there are only a few whole-unit treat- 
ments whose main effects are not required with high accuracy, is dis- 
cussed in section 8.6. 

In the first type the number of replications must be a multiple of the 
number of sub-unit treatments. For each whole-unit treatment, the 
sub-unit treatments are arranged in one or more separate latin squares. 
An example with 4 whole-unit treatments, 3 sub-unit treatments, and 3 
replications is shown in table 7.15. The latin squares are not very clearly 
displayed in the plan. If we examine the three units that receive аз, it is 
seen that each b appears once in the top sub-unit, once in the middle, 
and once at the bottom. Thus the rows of the latin square are the posi- 
tions of the sub-units within the unit. The only change from the ordi- 
nary split-plot arrangement is that differences among these rows are 
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eliminated from the sub-unit errors. If these differences are large, sub- 
unit comparisons are more precisely determined than in the ordinary de- 
sign. Whole-unit comparisons are unaffected. 

The designs are useful where a fairly consistent. gradient is expected 
within each unit. Examples have been quoted previously. In experi- 


TABLE 7.15 SPLIT-PLOT DESIGN WITH SUB-TREATMENTB IN A LATIN SQUARE 


а а а а 
В ы |bo| [bi 
КЕМЕ 
Вер. 1 ы bof fbi] [bo Analysis of variance 
В be ІМ ba Units d.f. 
“4 Replications 2 
A 3 
в а а чё Error (а) 6 
В [>| ET Total п 
П |) | Д be Sub-units 
til j| oo} fo Kon 5 
у |һ| |ы ы B 2 
АВ 6 
з ә ша а Error (b) 8 
В ЕГЕР Total 24 
fol 
ш n bi ü bo 
ы bo bo В 
18411 


ments on plant viruses, the sub-units may be the top, middle, and lower 
leaves of a plant, with a regular gradient of susceptibility down the 
plant. The sub-unit treatments may be operations performed in suc- 
cession by the same person, or drugs injected in succession to the same 
animal, where a time-trend is anticipated. 

The number of replicates may be any multiple of the number of sub- 
unit treatments. With 6 replicates of the design in table 7.15, the ex- 
perimental material is divided into 2 groups of 3 replicates each. If 
practicable, the division should be made in such a way that the within- 
unit gradient is constant in a given group. Since the rows are eliminated 
separately for each group, changes in the gradient from group to group 
do not increase the error, 


In the randomization the first step is to randomize the units according 
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to the instructions for the design in which they are arranged. For each 
whole-unit treatment a latin square of the size required to accommodate 
the sub-treatments is then randomized as described in section 4.33. А 
new randomization of the latin square is taken for each successive whole- 
unit treatment. The first column of each square is assigned to the first 
replication of the experiment, and so on. 

For extra replication the basic plan is repeated with a new randomiza- 
tion. 

Since the whole-unit analysis of variance is unaffected, it is necessary 
to show only the subdivision of the degrees of freedom among sub-units. 
Table 7.16 applies to the case in which the basic plan is repeated К times. 


TABLE 7.16 PARTITION OF DEGREES OF FREEDOM AMONG SUB-UNITS WHEN 
SUB-TREATMENTS FORM LATIN SQUARES 


d.f. 
Rows Ко(В — 1) 
Sub-unit treatments (8- ) 
Sub-unit X whole-unit treatments (a — 1)(8 — 1) 
Error (b) a(8 — 1)(r — 1 — k) 
Total ra(8 — 1) 


Notation: а = number of whole-unit treatments. 
8 = number of sub-unit treatments. 
r = number of replications = kp. 


The number of degrees of freedom for rows should be noted. Since 
each latin square supplies (8 — 1) degrees of freedom for rows, every 
whole-unit treatment contributes k(8 — 1) degrees of freedom, so that 
the total is ka(8 — 1) degrees of freedom. The sum of squares for rows 
is calculated by addition of the sums of squares for each of the ka latin 
squares. The sum of squares among the columns of the latin squares 
constitutes a whole-unit comparison and does not appear in the portion 
of the analysis shown above. 

Before using this design it is advisable to verify that the number of 
degrees of freedom for error (0) is adequate. In the example in table 7.15 
there are only 8 d.f., as against 16 with an ordinary split-plot arrange- 
ment. From section 2.31 it appears that the sub-unit error variance 
must be reduced by at least 15% in order to compensate for the decrease 
in degrees of freedom. 


7.34 Split-plot Technique with Confounded Designs. The treat- 
ments applied to the units may constitute a factorial system arranged in 
a design that confounds certain interactions completely or partially. 
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Since experimenters are sometimes uncertain how to compute the analysis 
of variance in such cases, the following notes indicate the method. 

1. The totals for each unit are analyzed by the instructions for the 
confounded design in which the units are arranged. 

2. In computing the sums of squares for the sub-unit treatments and 
for all interactions between sub-unit and whole-unit treatments, the 
confounding is ignored. Even if a whole-unit factorial effect is com- 
pletely confounded, its interactions with sub-unit treatments are un- 
confounded and may be tested by means of error (b). 

3. The sum of squares for error (b) is found as usual by subtracting the 
total s.s. on the whole units plus the total s.s. for sub-unit trentments 
and for interactions between sub-unit and whole-unit treatments from 
the total s.s. for sub-units. 

A difficulty arises occasionally in the presentation of tables. Suppose 
that a3 X 3 X 2 design is applied to the units in blocks of 6 units, with 
AB and ABC partially confounded, and that a factor D at 2 levels is ap- 
plied to the sub-units. In the A by B two-way table, the entries must be 
adjusted to eliminate block effects. The problem is how to present the 
ABD three-way table. The observed means (agbodo) and (agbod;) will 
average to the unadjusted mean for (aobo), which contains block effects. 
The simplest process is to calculate the difference between the adjusted 
and the unadjusted means of (aobo). This difference is then applied to 
(ао0о%) and (aobod;). Notice that this adjustment does not change the 
difference between these two entries and brings their mean to the correct 
value. 


7.35 Confounding of Comparisons among Sub-unit Treatments. 
When the sub-unit treatments are factorial, it may be advantageous to 
confound certain interactions among these factors with the units. The 
technique, first presented by Yates (7.7), is illustrated in table 7.17, 
which shows a single replication of the design. The factor A applied to 
the units has 2 levels, while the sub-unit treatments are the 8 combina- 
tions of factors B, C, and D at 2 levels each. The ordinary split-plot ar- 
rangement is shown in (i), while (ii) and (iii) are two alternatives for 
the new method. 

Consider first (i) and (1). In the ordinary arrangement the unit con- 
tains 8 sub-units, to which the 8 sub-treatments are allotted at random. 
With the new arrangement the sub-units are in groups of 4, chosen so 
that BCD is confounded with group totals. The advantage of the new 
layout is that the size of the whole-unit has been reduced from 8 to 4 sub- 
units. This will in general lead to more accurate estimates both of the 
A effects and of the sub-unit treatment s. The only exceptions are the 
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high-order interactions BCD and ABCD. In the ordinary arrangement 
these interactions are derived from comparisons among sub-units in the 
same unit, whereas in the new design they are estimated from com- 
parisons among the units, as the reader may verify. 

Arrangement (iii) goes a step further. The ag and a; treatments which 
have opposite sets of sub-unit treatments are placed in the same sub- 


TABLE 7.17 Conrounpina оғ BCD 1м A SPLIT-PLOT EXPERIMENT 


i. Ordinary arrangement її. New arrangement 
a 41 


а 
b Q0] (е 
bed e| d 
| [d cd | | bed 
Cil 


iii. New arrangement in sub-blocks 


Block Ia Block Ib 
а а do a 


Га | с b | (0 


ti 


& 


= 


= 


= 


block. The result is that ABCD is completely confounded between sub 
blocks, while BCD, like A, is orthogonal with sub-blocks. The aim is to 
sacrifice information on ABCD in the hope of obtaining more precise 
estimates of A and BCD. Consequently, alternative (iii) is usually 
preferable to (ii). 

With 5 replications of arrangement (iii), the partition of degrees of 
freedom is set out in table 7.18 in condensed form. There are 80 sub- 
units and 10 sub-blocks. 


TABLE 7.18 PARTITION OF DEGREES OF FREEDOM IN A 2 X 23 DESIGN WITH 
SPLIT-PLOT CONFOUNDING 


Units ағ Sub-units ағ. 
Between sub-blocks 9 B, C, D, BC, BD, CD 6 
A 1 АВ, AC, AD, ABC, ABD, ACD 6 
BCD 1 Error (b) 48 
Error (a) e РЁ 


Grand total between sub-units 79 
Total between units 19 
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All sums of squares are calculated in the usual way. Note that ABCD 
does not appear, since it is a component of the sum of squares between 
sub-blocks. 

When the sub-unit treatments form a 23 system, a similar technique 
may be used for any number of levels of the factor A. Single replications 
of the layouts are shown in table 7.19 for A at 2, 3, 4, 5, and 6 levels, 


TABLE 7.19 SPLIT-PLOT DESIGN FOR AN а X 2? FACTORIAL SYSTEM 


Arrangement of whole-units in a single replication 


a = 2:2 x 23) а = 3;(3 X 23) 
I, 1 Та 1 
Is). Gl) Е 
а = 4;(4 X 23) 
I, 1 
Г ems ) , 
ao | ay’ | аз | аз a’ | ay | а | аз 
uj J 
a = 5;(5 X 23) 
1. I 
І | ТІ "РТ o Беті 
ao | а | а | ag | а, а | ay’ | аг | аз |: 
а = 6;(6 X 23) 
1, 1 
EE 
| в | ay’ | аз | as | a4 | а’ | а | а | a | аз | a4 | ау 


Each unit contains 4 sub-units, A prime (^) implies that the sub-unit treatments 
are (1), bc, bd, and cd, and an unaccented a that they are b, c, d, and bed. 


The arrangements differ slightly according as the number of levels of 
А is even or 044. With an even level, one component of ABCD is con- 
founded between sub-blocks. For example, with A at 4 levels, this 
component is (аз + a, — аз — 49) ВСР. When А has an odd number 
of levels, we are forced to confound BCD at least partially with sub- 
blocks, and for simplicity it has been completely confounded. 

The randomization for a single replicate is as follows. (i) Allot the 
Sub-blocks a and b at random to the 2 halves into which the replicate has 
been divided. (ii) Allot the levels of A at random to units within each 
Sub-block. (iii) Allot the appropriate set of 4 sub-treatments at random 
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within each unit. A separate randomization is used in each replication. 
The partition of degrees of freedom in the analysis of variance is shown 
in table 7.20 for r replications of the basic design. 


TABLE 7.20 PARTITION OF DEGREES OF FREEDOM FOR AN а X 2° SPLIT-PLOT 


DESIGN 
Units Sub-units 

а even d.f. а odd d.f. d.f. 
Sub-blocks (2r — 1) (2r — 1) Sub-treatments 6 
A (a — 1) (а — 1) A Xsub-treatments 6(a — 1) 
BCD 1 Error (0) 6a(r — 1) 
ABCD (а — 2) @ =) Total бот 
Error (а) (а - 1)(r — 1) 2(«—1)r— 1) 

То{а1 (2ar — 1) (2от — 1) 


When a is odd, the sum of squares for BCD is omitted from the treat- 
ments; otherwise all sums of squares are obtained in the usual way. 
With o even, 1 d.f. is missing from ABCD, since it is confounded with 
sub-blocks. Since these designs are unlikely to be used unless ABCD is 
small, it will usually be satisfactory to combine the ABCD sum of 
squares with error (a). The pooled sum of squares can be found by sub- 
traction without calculating that for ABCD. 

A considerable number of arrangements of this type are possible, 
though some care is required in their construction in order to avoid 
confounding important comparisons. Finney (7.8) gives an account of 
the principles of construction and designs with worked examples, for a 
4 X 2? and a 6 X 2? factorial, where the units are in a 4 X 4 and 6 X 6 
latin square, respectively. Yates (7.7) gives a similar design for 6 X gs 
factorial. 
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СНАРТЕВ 8 


Factorial Experiments Confounded 
in Quasi-latin Squares 


8.1 Introduction 


Yates (8.1) has constructed a number of confounded arrangements for 
factorial designs in latin squares. In the first group of designs (plans 
8.1-8.7), some of the interactions among the factors are confounded with 
the rows and columns of the squares. These designs resemble ordinary 
latin squares in that the differences among the rows and columns of the 
squares are eliminated from the experimental errors of the unconfounded 
treatment effects; however, the designs do not possess the typical latin 
square property that each treatment appears once in every row and every 
column. Yates (8.1) has proposed the name quasi-latin squares. 

Before one of these plans is used, it is advisable to note the interactions 
that are confounded and the extent to which they are confounded, par- 
ticularly since more treatment effects must be confounded with latin 
squares than with randomized blocks. Of the eight plans shown, only 
one involves the confounding of two-factor interactions. 

In the statistical analysis, the sums of squares for rows, columns, and 
all unconfounded treatment effects are calculated in the usual way. 
Partially confounded effects, if considered negligible, may be combined 
with the error, which is found by subtraction; or they may be tested for 
significance by calculating their sums of squares from those portions of 
the experiment in which the effects are unconfounded. 

In a further group of designs (plans 8.8-8.14), certain treatments are 
applied to complete rows or columns of a latin square. These designs 
have been called half-plaid squares or plaid squares, from the resemblance 
of the plan to a Scotch plaid. 


8.2 Randomization of Quasi-latin Squares 


The rows and the columns of each latin square are rearranged at ran- 
dom. In certain plans (e.g., 8.2) each square contains several replica- 
tions. The randomization must not be restricted in such cases with the 
object of keeping the replications separate. 
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8.3 Notes on the Plans and Statistical Analysis 


28 factorial in a 4 X 4 square. There are a number of alternative de- 
signs, of which two are given in plan 8.1а and 8.16. In the former, which 
requires 4 replications, }4 relative information is retained on AB, АС, 
BC, and ABC, main effects being unconfounded. On account of the high 
degree of confounding of the two-factor interactions, this plan is not 
recommended unless the factors are presumed to act independently or 
unless the 4 X 4 square is considered much more accurate than alter- 
native designs. 

A numerical example illustrates the general method of analysis for all 
designs presented here. The data in table 8.1 represent twice the yields 


TABLE 8.1 YIELDS AND STATISTICAL ANALYSIS OF A 2° FACTORIAL IN TWO 
4 X 4 QUASI-LATIN SQUARES 


Square I Square II 
ABC АС АВ BC 
с (1) abe а Totals с а ab b Totals 
57.9 62.3 60.2 58.6 239.0 62.4 64.6 68.5 63.1 258.6 
АВ АВС 
а ас b bc ( ac bc a 
66.2 60.5 60.0 53.4 240.1 61.5 62.7 58.6 69.3 252.1 
ас № (1) а ас b (1) ac 
641 613 63.2 59.2 247.8 66.0 56.5 63.5 67.3 253.3 
BC AC 
b ab ас с ab Ы а с 
58.4 61.8 67.0 61.9 2491 65.0 52.66 581 58.1 234.7 
Totals 246.6 245.9 250.4 233.1 976.0 255.8 236.4 248.7 257.8 998.7 
d.f. 8.8. m.s. 
Squares 1 16.10 
Rows 6 100.90 16.82 
Columns 6 112.71 18.78 
Ag Boc 3 146.43 
AB I 18.92 18.92 
AC 1^ 0.06 0.06 
BC r 0.33 0.33 
ABC uU 8.27 8.27 
Error ll 101.86 9.26 
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in bushels per acre from a soil-treatment test on soybeans, st Muscatine, 
Iowa. The plots were split for a varietal comparison which has been 
omitted. For convenience, rows and columns were rearranged so that 
the field plan corresponds to plan 8.1а. The factors were limestone (A), 
phosphorus (B), and potash (С); in each case the lower level implies no 
application of the treatment. 

To obtain the sums of squares for the treatment effects, first calculate 
the factorial effect totals in the usual way. These are shown in the first 
line of table 8.2. For the partially confounded effects, these totals must 


TABLE 8.2 TREATMENT EFFECTS FOR A 2° FACTORIAL IN 4 X 4 
QUASI-LATIN SQUARES 


[А] [В| сі [AB] [AC] [ВС] [ABC] 


Totals +658 —181 —97 +357 +349 — 81 — 4.3 
From rows and columns +183 +359 —104 + 7.2 
Adjusted totals 4174 —10 423 —11.5 


be adjusted. Now AB is completely confounded with the first 2 rows of 
square I and the first 2 columns of square II. Calculate AB as obtained 
from these rows and columns, i.e., 


239.0 — 240.1 + 255.8 — 236.4 = +18.3 


This figure is placed in the second row of table 8.2, with corresponding 
figures for AC, BC, and ABC. The adjusted totals are found by sub- 
tracting the second from the first row. Of course, these represent just 
the totals for each effect as found from that part of the experiment in 
which the effect is unconfounded. The divisors for the squares are 32 
for main effects and 16 for the other effects. 

The two-way tables of means require adjustment. For the AB table 
the adjustment is 


шш ae 


— —0.03 
16 32 % 


This value is added (algebraically) to the (1) and (ab) means and sub- 
tracted from the (a) and (0) means. 

Plan 8.1b completely confounds ABC with squares; all other effects 
are unconfounded. The arrangement has the peculiarity that half the 
treatments appear only in square I, while the other half appear only in 
square II. Although this layout avoids the confounding of any two- 
factor interactions, Yates has pointed out that any interaction of the A 
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effect with squares will appear in the analysis as a BC interaction, The 
results are therefore likely to be misleading if the experimental material 
in square I differs in responsiveness from that in square II. Where this 
danger seems likely, Yates suggests that the columns (or rows) of the 
two squares may be interlaced, so that each square comprizes a ran- 
domly chosen half of the experimental material. A numerical example 
of the statistical analysis is given by Yates (8.1, p. 33), using uniformity 
data. 

24 factorial in an 8 X 8 square. Four replications are required for plan 
8.2. ABCD is completely confounded, and threefold replication is ob- 
tained on all three-factor interactions. The degrees of freedom sub- 
divide as follows, 


d.f. 
Rows 7 
Columns { 
Treatments 14 
Error 35 


In the computation for ABC, ABD, ACD, or BCD, the appropriate pair 
of rows is deleted. 

25 factorial in an 8 X 8 square. The confounding in plan 8.3 (for 2 
replications) is not completely balanced, Eight of the 10 three-factor 
interactions and four of the 5 four-factor interactions receive only single 
replication, The remaining high-order interactions, ABE, CDE, ABCD, 
and ABCDE, are unconfounded. By a proper assignment of the letters 
а, b, +++ to the factors, any single three-factor interaction may be left 
unconfounded, though not any desired pair of interactions. 

The partition of degrees of freedom is: 


d.f. 
Rows 7 
Columns 7 
Treatments 31 
Error 18 


In the computation of the sum of squares for ABC, ADE, or BCDE, the 
4 rows in the experiment which correspond to the first 4 rews of plan 8.8 
are omitted. The 4 rows or columns to be omitted for any other partially 
confounded comparison are found similarly from plan 8.3. If the high- 
order interaction effects are likely to be negligible, all partially con- 
founded effects and ABCD and ABCDE may be combined with the 
error. This procedure gives 17 d.f. for treatments and 32 for error. 
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2 factorial in an 8 Х 8 square. In each square of plan 8.4, 8 three- 
factor and 6 four-factor interactions are completely confounded. If the 
experiment has a single replication and if only the main effects and two- 
factor interactions are isolated for tests of significance, the degrees of 
freedom are allotted as follows. 


df. 
Rows 7 
Columns T 
Main effects 6 
Two-factor interactions 15 


Error (estimated from high-order interactions) 28 


Any unconfounded three-factor interaction may also be tested. 

When there are several replications in the experiment, the squares 
should be chosen so that the most important three-factor interactions 
are free from confounding. The balanced design, which requires 5 repli- 
cations, gives 3/5 relative information on all three- and four-factor inter- 
actions. 

3? factorial in a 9 X 9 square. Three or six replicates may be used 
(plan 8.5). Four of the 8 d.f. for ABC are completely confounded in each 
square. The 2 squares together give balanced confounding of АВС, with 
1/2 relative information. 

When square I alone is used, the components of the analysis of vari- 
ance are: 


d.f. 
Rows 8 
Columns 8 
A,B,C 6 
АВ, АС, BC 12 
АВС 4 
Error 42 


The 4 d.f. for АВС are evaluated as follows. Compute the total yields 
of the 3 groups of 9 treatments which are deseribed as set П (a), (с), and 
(b) in square 2. The sum of squares of deviations of these З totals, with 
divisor 27, constitutes 2 of the 4 d.f. The remaining 2 are found similarly 
from set IV (a), (b), and (c) in square 2. 

With both squares, all 8 d.f. for АВС may be computed, sets I and III 
being taken from the experimental data for square II, and sets II and IV 
from the data for square I. 

8* factorial in a 9 X 9 square. Each square in plan 8.6 provides 1/2 
relative information on ABC, ABD, ACD, and BCD. For an experiment 
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with a single replication, the simplest subdivision of the degrees of free- 
dom is: 


d.f. 
Rows 8 
Columns 8 
Main effects 8 
Two-factor interactions 24 


Error (estimated from high-order interactions) 32 


When 2 replications are available, both squares should be used. АП 
8 d.f. for any three-factor interaction may be tested, though each is based 
on only a single replication. The calculation of the sums of squares for 
the three-factor interactions follows the method outlined for the 3* fac- 
torial in blocks of 9 units (section 6.32). Plan 8.6 indicates the replica- 
tion from which any component must be taken. Thus, if ABC is being 
calculated, we use the data from square II for ABC I and ABC IV and 
the data from square I for ABC II and ABC III. 

4 X 2 factorial in an 8 X 8 square. Plan 8.7 requires 4 replications. 
The symbols A’, А”, A’” refer to components of the A effects. 


А’ = аз + аа – ә – ај A” = аз — aj — ә + а; 
Al” = аз — а + а — а 


No two-factor interaction is confounded. 

In view of the small relative information on ABC (only 1/3 on the 
average), the sum of squares for this term will usually be combined with 
the error. In this event the analysis of variance reads as follows. 


d.f. 
Rows 7 
Columns 7 
Main effects and two-factor interactions 12 
Error (by subtraction) 37 


8.4 Other Quasi-latin Squares 


Yates (8.1) gives experimental plans which can be used for the follow- 
ing factorial combinations: 32 X 2 (6 X 6 square), 4 X 23 (8 X8 
square), and 9? (9 X 9 square). Analogous to the last design are designs 
for 7? (7 X 7 square) and 5? (5 X 5 square). In a general discussion of 
methods for constructing such plans, Rao (8.3) presents designs for a 2* 
factorial in 4 Х 4 squares. 

For certain factorial combinations of treatments, the lattice squares 
(chapter 12) and the incomplete latin squares (chapter 13) confound ай 
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treatment effects to the same extent. These arrangements may be use- 
ful in cases where the experimenter does not wish to sacrifice information 
on the interactions. 


8.5 Estimation of the Efficiency of Quasi-latin Squares 


In many cases the precision of a quasi-latin square can be compared 
with that of some other design which might have been used. The data 
from the example in table 8.1 will illustrate two types of comparison. 

In this experiment a 2° factorial system applied to plots of soybeans 
was arranged in two 4 X 4 squares, with 1/2 relative information on 
AB, AC, BC, and ABC. As an alternative, the experiment might have 
been laid out in randomized incomplete blocks of 4 plots each, with 
complete confounding of ABC (plan 6.1). Since the columns were more 
compact in shape than the rows, we will suppose that the columns would 
have been chosen as blocks. 

We first estimate the error m.s. that would have been obtained with 
incomplete blocks. It is helpful to construct analyses of variance for the 
2 designs (table 8.3) for the case where there are no treatments. The 


TABLE 8.3 ANALYSES OF VARIANCE (APPLICABLE TO UNIFORMITY DATA) 


Quasi-latin square Incomplete blocks 
d.f. m.s. d.f. m.s. 
Columns 6 18.78 Blocks (columns) 6 18.78 
Rows 6 16.82 Intra-blocks 24 (11.15) 
Intra-row-and-column 18 9.26 


mean squares to be inserted in the table are explained in the next para- 
graph. 

With incomplete blocks, the degrees of freedom to be divided between 
treatments and error are the 24 d.f. within columns (3 from each of the 
8 columns). Since the randomization ascribes to treatments а random 
selection from these 24 d.f., the mean square for the 24 d.f. provides an 
unbiased estimate of the error appropriate to the incomplete block de- 
Sign. The 24 d.f. subdivide into 6 representing variations among rows, 
and 18 representing intra-row-and-column variation. If the interactions 
partially confounded with the rows are regarded as negligible, the rows 
m.s. (16.82) from the quasi-latin square analysis provides an estimate of 
the 6 d.f. The quasi-latin square analysis does not supply information 
on the whole of the remaining 18 d.f., since in the experiment 7 of these 
were ascribed to treatments. By virtue of the randomization for the 
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quasi-latin square, however, the error m.s. (9.26), with 11 d.f., may be 
taken as an estimate of the mean square for the whole 18 d.f. 
Accordingly, an unbiased estimate of the error m.s. for the incomplete 
block design is 
18 X 9.26 + 6 X 16.82 


24 


= 11.15 


For the estimation of main effects, the efficiency of the quasi-latin 
square relative to incomplete blocks is estimated as 11.15/9.26, or 120%. 
Owing to the confounding in the quasi-latin square, the relative efficiency 
on the interactions is only 60%. 

The quasi-latin square may also be compared with ordinary ran- 
domized blocks of 8 plots, where cach replication is a pair of columns. 
In table 8.1 the column totals are as follows. 


Rep. I Rep. II Rep. Ш Rep. IV 
246.6 245.9 250.4 233.1 255.8 236.4 248.7 257.8 


With uniformity data, the randomized blocks error contains 28 d.f. 
(7 from each replicate). То obtain this error, the incomplete blocks 
error (24 d.f.) must be pooled with the 4 d.f. which measure differences 
between the 2 columns in each pair. Since the differences between the 
2 column totals in each replication are 0.7, 17.3, 19.4, and 9.1, respec- 
tively, the sum of squares for the 4 d.f. is 


(0.7)? + (17.3)? + (19.4)? + (9.1)? 
8 


= 94.87 


This gives [24(11.15) + 94.87]/28 = 12.95 as an estimate of the random- 
ized blocks error. The relative efficiency of the quasi-latin square is 
140% on main effects and 70% on interactions. 

The comparisons above require the assumption that treatment effects 
which are confounded with rows or columns are non-existent. By more 
complicated procedures, estimates can be made which avoid those аз- 
sumptions. Тһе relative efficiencies quoted above are slightly too high, 
because we neglected the effect of the additional error d.f. that would 
be available in the alternative designs (section 2.31). 


8.6 Treatments Applied to Complete Rows of a Latin Square 


A simple example (taken from plan 8.8) is shown below. The latin 
square treatments form а 2 X 2 factorial for factors В and С. Ап extra 
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factor А has been superimposed оп the rows. The first 2 rows receive 
the lower level of A, and the last 2 rows the higher level. 


ABC 


Q |b c (D be 
Qje 5 b () 
A 


a |) b b с 
а |с (c b 


This plan might be tried in a case where the extra factor was not suit- 
able for application to individual units. No precise estimate of the ef- 
fect of А would be expected, but some useful information on the inter- 
actions between А and the other factors might be hoped for. However, 
this device has one property that necessitates caution in its use. When- 
ever an extra factor is superimposed on the rows, certain interactions 
between that factor and the latin square treatments are automatically 
confounded with columns, as noted by Yates (8.2). In the plan above, 
the ABC interaction 


(abc) + (a) + (b) + (c) — (ab) — (ac) — (bc) — (1) 


is the difference between the first 2 columns and the last 2 columns. On 
the other hand, AB and AC are free of column effects. In the plans pre- 
sented here, which are due to Yates (8.1), such confounding is confined 
аз far as possible to high-order interactions. The reader who constructs 
his own designs should verify, before using them, what interactions have 
been confounded in this way. 

As indicated above, these designs are useful in experiments where (i) 
it is not easy to apply factor A to individual units, and (ii) accurate 
estimates of the interactions of 4, but not of its main effects, are wanted. 
For instance, in an experiment on an irrigated crop, we might wish to 
find out how the responses to plant nutrients B and C are affected by the 
level of the water supply. Factor А could then represent à restricted 
and an abundant amount of irrigation water, applied to whole rows of the 
square. 

These half-plaid squares are related both to quasi-latin squares and to 
split-plot designs. The difference from quasi-latin squares is that in the 
latter the confounding is confined to interactions, whereas main effects 
also are confounded in half-plaid squares. The connection with split- 
plot designs is seen when the rows are regarded as the experimental 
units, and the units (or plots) as sub-units. From this point of view the 
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plan above may be considered as a split-plot design with the sub-units 
arranged in latin squares. 

A moderately large number of useful half-plaid squares can be con- 
structed; only five of the simplest are given here. In the notation em- 
ployed, the factorial system for the latin square treatments is placed in 
parentheses ( ) while the number of treatments imposed on the rows 
is shown before the parentheses. Thus plan 8.8 shows a 2 X (2?) fac- 
torial. 

As with an ordinary latin square, rows and columns must both be per- 
muted at random. The whole-row treatments cannot be arranged in 
separate blocks. 

The statistical analysis may be followed from the partition of the de- 
grees of freedom, which is given in condensed form with each plan. As 
in the split-plot design, there are two errors. The error for the factor A 
applied to the rows is derived from the sum of squares among rows minus 
the sum of squares for A. 

Most of the other treatment effects are unconfounded. Their sums of 
Squares are caleulated in the usual way and tested against the latin 
square error. In plans 8.9 and 8.10 certain interactions are partially con- 
founded with columns; the caleulations for these effects are described 
in the notes below. In the remainder of the plans, some interactions are 
completely confounded with rows or columns and consequently do not 
appear in the treatments s.s. The error s.s. is found by subtraction. 


8.61 Notes on the Plans and Statistical Analysis. 2 х (2?) factorial 
н а4 б 4 square (plan 8.8). At least two squares (4 replications) should 
е used. 

2 X (8 X 2) factorial in a 6 X 6 square (plan 8.9). Each square con- 
tains 3 replicates. BC and ABC are partially confounded with columns, 
the relative information being 8/9 and 5/9, respectively. The sums of 
squares for these effects are calculated by the procedure for the 3 X 2? 
factorial in incomplete blocks of 6 units, where the columns constitute 
the blocks. A numerical example is given by Yates (8.1, p. 58). Col- 
umns Ia --- ІШ are identical, respectively, with Yates’ blocks Ia -+-+ 
IIIb. Notice that the factor applied to the rows is C. 

3 X (8 X 2) factorial in a 6 X 6 square (plan 8.10). Each square re- 
quires 2 replicates, while the 2 squares comprize a balanced set. Col- 
umn effects are partially confounded with AB and ABC, the average 
relative information being 7/8 and 5/8, respectively. If the columns are 
regarded as incomplete blocks, the numerical example in section 6.19 
may be followed for the computation of the sums of squares for AB and 
ABC. 
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The statistical analyses for the 2 X (2%) factorial (plan 8.11) and the 
2х (2*) factorial (plan 8.12) іп 8 X 8 squares are straightforward. In 
the latter, the BCDE and ABCDE interactions have been combined 
with the error for A. 

Other designs. Yates (8.1, р. 79) presents the plans for a 4 X (2?) fac- 
torial in an 8 X 8 square and (8.1, p. 80) for à 3 X (33) factorial in а 
9 9 square. Plans can also be constructed for a 2 X (25) system in a 
8 X 8 square and for a 3 X (3?) system in a 9 X 9 square. 


8.7 Treatments Applied to Complete Rows and Columns of a Latin 
Square 


In these designs one set of treatments А is applied to complete rows 
of the latin square and another set B to complete columns. Although 
relatively low precision is obtained on the row and column treatments, 
the latin square treatments and their more important interactions with 
the А and B treatments are subject only to the latin square error. The 
number of useful designs appears to be limited. 

Two examples are shown in plans 8.13 and 8.14. The latin square 
treatments form a 23 and 2* system, respectively; the additional treat- 
ments are both at 2 levels. No two-factor interactions are confounded. 
The rows and columns of each square should be completely randomized. 

The subdivision of degrees of freedom is indicated on each plan. In- 
teractions which are confounded with rows and columns must be omitted 
from the treatments 8.8.: all other treatment effects are calculated in the 
usual way. In plan 8.14, where the square contains a single replication, 
the only available estimate of error comes from the high-order inter- 
actions, unless two or more squares are used. 

A 9 X 9 square for а 3? system of treatments, with row and column 
treatments also at 3 levels each, is given in reference (8.1), p. 81. 
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PLANS 
Plan 8.1a 2? factorial in two 4 X 4 quasi-latin squares 
Square I Square II 
ABC AC AB BC 
с (1) | abe ab с а |ab b а 
а ac|b te zum (1) аа| ab inde 
abe bc | (1) a abe b |(1 ac 
b а |а c 86 ab bela c oe 
Plan 8.16 2 factorial in two 4 X 4 latin squares 
ABC completely confounded with squares 
Square I Square П 
(1) be ae ab 8 а асс 
ac ab (1) bc асс a b 
bc (1) ab ac 8.55 c abc 
ab ac be (1) c abc b a 
Plan 8.2 24 factorial in an 8 X 8 quasi-latin Square 
ABCD 
c abed b ad а bd ас cd, Y 
abd () bod ic. ad a а ah ABC 
d к а abd b а ad ac 
bed ad acd bd abe ab с (1) сар 
а 4 с ab а abd acd bc 
abe ac abd cd 54 () b ш ACD 
b ab: qid: 7 Ges де ad bed аһа 
acd cd abe (1) abd b а ы BOD 


PLANS 


Plan 8.8 25 factorial іп an 8 X 8 quasi-latin square 
ACE, BCD, ABDE ACD, BDE, ABCE 


() abe be ace |abd acd bede de 
be ас е ab bed а abde acde 
c 


cde abd Ме ad abce ae b ABC, ADE, BCDE 


abc се acde bed (1) Ме ad abe 

acd ede abce c ade ab е bd 

беа а ы [aie № Miaa Mari Par: BO AGDE 

ae b abd de ac abede cd bce 

Plan 8.4 2° factorial in 8 X 8 quasi-latin squares 
Square I 
Columns confounded with Rows con- 
CDE, BDF, ABE, ACF, BCEF, ABCD, ADEF founded with 
abcdef ce bf bde ае abe ай cd 
574,2 adef bef of M aki 4В6 
bdf af abcef ака с be cdef айе ADF. 
acf bedf def (1) abd асе abef bce СЕР 
be acd аме abf df bedef е acef ACDE 
еј abdef acdf асе bede d bef аһ BCDF 
ad b ce оў abf aef abcde. еј АВЕР 
abe de bed bef f abdf ac j 
Square II 
АВС, CDE, ADF, BEF, ABDE, BCDF, ACEF 

abcdef bf «ДІ bee abd ac, ау de 
ae 5 def, ај df abode beef age 
cde a ака асеў d bef фе ABD 
aif с ef (0) ай abe dif bed ДЕР 
abe e се 5а abf асау аа f BDEF 
bd abce ade а}, of { с асај ACDE 
bef abcdf af abde bc cdef асе АВСР 


of adef abcef acd е bede bdf а 


Square III 
ABF, CDF, ADE, BCE, ABCD, BDEF, ACEF 


abcdef be def E abd 


af 
be, 
d 
n ка сј bdef аке асе} ad е ABEF 
ce 
d 
bedf 
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Plan 8.4 (Continued) 2° factorial in 8 X 8 quasi-latin squares 
Square IV 

BCD, ACE, ABF, DEF, ABDE, ACDF, BCEF 
aby ad ce des ef ^ df abedef 
de bee ама а асае] abef м 4 
с м abe acde af abcdf beef of 
be; cde; e Е 9 bed 

абс] bedef е e асе 


acdf aby 
ме x өмі d 


Square V 


ахау bdf be acd de cf abe af 
аса) 


се а if bef abef abcde def Ый 
bed abde  abcef adf ас bf e 
А abf abd (1) ace adef bede cdf 
ү; bedef be асеў abc 
ab ке — bdef aef с d abedf acde 
ac ef cde аӛсе b bedf ad abde; 
acdef ae bde abcd ab; с beef 
Plan 8.5 33 factorial іп 9 Х 9 quasi-latin squares 


Square I 
ABC III confounded 


Set ПТ (а) 000 102 201 011 110 212 022 121 


220 


(с) 101 200 002 112 211 010 120 222 


021 


(5) 202 001 100 210 012 111 221 020 
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201 


002 


100 
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(9 


АВСП 


Plan 8.5 (Continued) 
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Set ТУ (а) 000 101 202 012 110 211 021 122 220 


(b 102 200 001 111 212 010 120 221 022 


( 201 002 100 210 011 112 


(а) 012 110 211 021 122 220 


(b 111 212 010 120 221 022 102 200 001 ABCI 


(с) 210 011 112 222 020 121 201 002 100 


(а) 021 122 220 000 101 202 012 110 211 


(b 120 221 022 102 200 001 111 212 010 


(©) 222 020 121 201 002 100 210 011 112 


Set II (à (c) (0) @ © (b @ (9 


Plan 8.6 3* factorial in 9 X 9 quasi-latin squares 
Square I 
Columns confounded with ABC IV, ABD I, ACD II, 
BCDI Rows confounded with 


0000 1011 2022 0121 1102 2110 0212 1220 2201 
1021 2002 0010 1112 2120 0101 1200 2211 0222 
2012 0020 1001 2100 0111 1122 2221 0202 1210 
0122 1100 2111 0210 1221 2202 0001 1012 2020 
1110 2121 0102 1201 2212 0220 1022 2000 0011 
2101 0112 1120 2222 0200 1211 2010 0021 1002 
0211 1222 2200 0002 1010 2021 0120 1101 2112 
1202 2210 0221 1020 2001 0012 1111 2122 0100 
2220 0201 1212 2011 0022 1000 2102 0110 1121 


Square II 
Columns confounded with ABC IIT, ABD II, ACD III, 
BCD Iv 
0000 1022 2011 0112 1101 2120 0221 1210 2202 
1012 2001 0020 1121 2110 0102 1200 2222 0211 
2021 0010 1002 2100 0122 1111 2212 0201 1220 
0111 1100 2122 0220 1212 2201 0002 1021 2010 
1120 2112 0101 1202 2221 0210 1011 2000 0022 
2102 0121 1110 2211 0200 1222 2020 0012 1001 
0222 1211 2200 0001 1020 2012 0110 1102 2121 
1201 2220 0212 1010 2002 0021 1122 2111 0100 
2210 0202 1221 2022 0011 1000 2101 0120 1112 


ABC I 
ABD Ш 
ACD IV 
BCD и 


Rows confounded with 
ABC II 
ABDIV 
ACDI 
BCD Ill 
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4 X 22 factorial in an 8 X 8 quasi-latin square 
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2 X (27) factorial іп a 4 X 4 half-plaid square 


ABC 


Rows 
а) А 


Error 


k squares (2k replicates) 
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2 X (3 X 2) factorial in a 6 X 6 half-plaid square 


BC, ABC partially confounded 
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Plan 8.10 3 X (3 X 2) factorial in a 6 X 6 half-plaid square 
Square I Square IT 
AB, ABC partially confounded AB, ABC partially confounded 
a | be a | bc 
0 20 10 00 21 17 01 0 10 20 00 1 21 0 
0 по 21 10 0 20 0] 21 01 11 2 00 10 
1,19 2 10 074 п A 1 (POO: 310599230 © о” Hat 
1/21 11 0 2 10 00 1 11 21 O1 10 20 00 
2/01 21 11 00 20 10 2190 00 10 21 01 п 
210 00 20 пп 0” 2/01 11 21 00 10 2 
Ia Ib Ie Па Ш Пс Ша IIb Шс IVa IVb IVe 
Square I alone (2 replicates) Both squares (4 replicates) 
Rows 5 Squares 1 
A 2 Rows 10 
Error 3 A 2 
Columns 5 Error 8 
B, C, ВО, AQ 7 Columns 10 
AB 4 В, С, ВС, АС 7 
АВС 4 АВ 4 
Error 10 ABC 4 
— Error 35 
Total 35 +] 
То{а1 71 
Plan 8.11 2 X (23) factorial in an 8 X 8 half-plaid square 


ABCD confounded 


-|(1) be bd cd b c а bed Rows 7 
-| (1) d bd а ted bic A 1 
- bd cd (1) bc bd d c b Ег 6 
= |ed bd bc (1) c b bed d Columns T 
E B,C,D 3 
ajb с d bed (1) bd cd be AB,AC,AD,BC,BD,CD 6 
ajc b bed d be «а bd (1) ABC, ABD, ACD, BCD 4 
ajd bed b с а (1) be bd Error 36 
ааа c b bd be (1) ed "c 
"Total 63 
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2 X (24) factorial in an 8 X 8 half-plaid square 


ABD, BCE, ACDE confounded 


(1) bd be cd ce bede be de Rows 7 
bed с d bee bde e cde b A 1 
се bedebe de (1) bd be cd Error 6 
bde e cde b bed с а bee Columns 7 
Main effects 4 
be cd ce Бойе be de (1) bd Two-factor interactions 10 
d b bed c cde bee bde e Three-factor * 8 
be de (1) bd be cd ce bede Four-factor e 3 
cde bre bde e d b bodce Error 24 
Total 63 


2 x 2 X (23) factorial in an 8 X 8 plaid square 


B, ACDE, ABCDE confounded 


= - - bb bb 
ce c de d 
cd e (1) cde 
de d се c 
(1) cde cd e 
с ‘а de 
е (1) cde cd 
d de c с 
cde cd e (1) 


odef f ed |f 


Rows 7 
A 1 
Error 6 

Columns 7 
B 1 
Error 6 

Main effects 3 

Two-factor interactions 10 

Three-factor ы 9 
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Error 24 

Total 63 
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GHAPTER “SA 


Some Methods for the Study 
of Response Surfaces 


8A.1 First Order Designs 


8A.11 Introduction. In chapter 6 it was pointed out that the most 
informative method of analysis of the results of a factorial experiment 
depends on the nature of the factors. If all the factors represent quan- 
titative variables like time, temperature, amount of nitrogen, it is nat- 
ural to think of the yield or response y as a function of the levels of 
these variables. We may write 


Yu = 9(Тіу) ou, 777) Tku) + eu 


where и = 1, 2, ... N represents the М observations in the factorial 
experiment and z;, represents the level of the ith factor in the uth obser- 
vation. The function ¢ is called the response surface. The residual e, 
measures the experimental error of the uth observation. A knowledge 
of the function ¢ gives a complete summary of the results of the experi- 
ment and also enables us to predict the response for values of the хм 
that were not tested in the experiment. 

When the mathematical form of ¢ is not known, this function can 
sometimes be approximated satisfactorily, within the experimental re- 
gion, by a polynomial in the variables хм. In this chapter we describe 
some experimental designs and methods of analysis that have been de- 
veloped for fitting polynomials of the first and second degree. Box 
(8A.1) has called these designs first order designs and second order de- 
signs, respectively. The second part of this chapter deals with an im- 
portant problem in which these designs can be used—namely, that of 
finding the level at which each of the z-variables should be set in order 
to maximize $. 

First, a word of caution. Polynomial response surfaces have the great 
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advantage that they are easy to fit. With a suitable choice of design, 
even a quadratic surface in 6 variables, which contains 28 coefficients 
to be estimated, is not too formidable a task. On the other hand, 
polynomials are notoriously untrustworthy when extrapolated. A poly- 
nomial surface should be regarded only as an approximation to ¢ within 
the region covered by the experiment. Any prediction made from the 
polynomial about the response outside the region should be verified by 
experiments before putting reliance on it. 


8A.12 Calculations for a Multiple Linear Regression. Since the fit- 
ting of a polynomial can be treated as a particular case of multiple linear 
regression, we shall review the calculations required to fit a multiple 
linear regression of y, on the k variables хм (+ = 1, 2,---k; и = 1, 
2,... №). The relation between y, and the тұ, is of the form 


Yu = Bo + Віла + Botou +" А Batu + eu 
Since Во occurs in every equation, it is customary, for the sake of uni- 
formity, to introduce a dummy variable zo, which has the value 4-1 for 
every observation in the sample. The equation can then be written 


Yu = Botou + Biziu + Batou +++*+ Bktku + eu 
Step 1. List the sample values of the z;, and the y, as in table 8A.1. 
The two-way array of the z;, values is called the X matrix, and the col- 
umn of у, values is called the Y vector. 
Step 2. The least squares estimates b; of the В; are chosen so as to 
minimize the sum of squares of deviations 


N 
У (ун — болош — bitin —* ++ — бы)? 


u-l 


The values of the b; which minimize this expression satisfy the normal 
equations 
bo(00) + b (01) +-++-+ (05) = (0y) 


bo(10) + bi (11) ++++-+ 0010) = (1y) 


5о(/0) + bi (kl) +--+ 5(Е) = (ky) 


where 3s 
(0) = (ji) = È хил = Sum of products of ith and jth columns in X. 
N u-l 
(ii) = У) ты? = Sum of squares of ith column in X. 
ual 


N 
(iy) = У) шуш = Sum of products of ith column in X with Y. 


и=1 
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It is sufficient to list the sums of squares and products (17) and (zy), as 
shown in table 8A.1, without explicitly writing out the equations. The 
square array (17) is called the Х'Х matrix, and the column (iy) is called 
the X'Y vector. 

Мер 8. Although the normal equations can be solved directly, it is 
useful to find a set of intermediate quantities с;;, known as the inverse 
of the matrix (ij). The equations satisfied by cio, си, “7: ci һауе the 
same left-hand side as those for the b;, but the right side consists of zeros 
except in the 7th equation, where the right side is anity. Various methods 
for computing the inverse matrix are available (8A.2, 8A.3, 8A.4). 

Step 4. The regression coefficients b; are obtained as follows. 


k 
b; = У) cj(jy) = Sum of products of ith column of сұ) with the 
j=0 column (jy). 


The formulae for obtaining the residual mean square and the standard 
errors of the b; are given at the foot of table 8A.1. 


TABLE 8А.1 CALCULATIONS FOR A MULTIPLE LINEAR REGRESSION 


Data 
xX Y “ 

Zor zu КУ; та yı 

тоз тп жүт) тк y2 

ton IN an ТЕМ YN 
Normal equations 

üj) =X'X (iy) = X'Y 

(00 (01) ... (0k) (Oy) 

(10) (11) (10 (ly) 

(kO) (а) ==- (kh) (ky) 
Inverse matrix 

сут (ХІХ 

со си $t 0 

сю си e Ck 

сю сы tU б 


Solutions E 


bi = È сі) 
j=0 
k 

Residual sum of squares = В» = (yy) — У) bitty) 
іш 


Residual mean square = s* = Ry/(N — k — 1) 


У) = сз? Соқ) = ся" 
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The normal equations are particularly easy to solve when the variables 
Ту are mutually orthogonal. In this case all sums of products (ij) 
vanish (¢  j), and the normal equation for b; reduces to 


(4) = (iy) 
so that 
2 0) 
* (gu) 
The values of the inverse matrix become ci = 1/(й); с; = 0, i £j. 
Since cj; = 0, the b; are uncorrelated. 

In constructing experimental designs from which regression equations 
are to be computed, it is desirable to obtain normal equations that pre- 
sent little difficulty in solution. Fortunately, the designs that are con- 
structed so as to give the most precise estimates in a polynomial regres- 
sion also turn out to be fairly simple to handle computationally. 


84.13 Numerical Example for a 23 Factorial. Аз an illustration of 
the general approach, a linear equation will be fitted to the results of а 
2° factorial. The data are artificial, and deal with three factors that 
influence the yield of a chemical reaction. 


А = a = Time of reaction (5.5 and 6.5 hr.) 
В = x2 = Temperature (260° and 340°) 
С = = Amount of ап ingredient С (1.4 and 1.8 gm.) 
The equation to be fitted is 
Du = bo + bitin + boron + Вади 


where 2, is the predicted value of у, (u = 1, 2, --- 8). й 
Since each z-variable takes only two different values, the calculations 
are simplified by coding each z-scale so that the upper level of z is +1 


and the lower level is —1, The relations between the coded and the 
original scales are as follows: 


к tof C — 16 
(Шише —6); are cR. ONU. UL Amount of C — 16 
40 02 


The coded values of the z's and the yields appear in table 84.2. The 
computations follow the steps outlined in the preceding section. In the 
X matrix (table 8A.2) the sum of squares of any column is 8 and the 
sum of products of any two columns is 0. The (7j) matrix therefore 
assumes the simple orthogonal form, as does its inverse c;; From the 
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TABLE 8A.2 FITTING А LINEAR EQUATION TO А 2° FACTORIAL 


Data 
x x 
Coded scale 
Treatment zo ті та 23 у 
а) 1-1. -1--1 161 
a 1 1 -1 =l 183 
b 1 —1 1 —1 151 
ab 1 1 1 —1 170 
ё $ —1 —L 1 166 
ac 1 1 -1 1 192 
be 1 =f 1 1 156 
abe 1 1 1 1 183 
Normal equations 
(i) = ХХ (iy) = X'Y 
8 1362 = G 
8 94 =4А 
8 —42 = 4B 
8 32 = 4C 
Inverse matriz 
cij = QCUX) 
и 
и 
% 
и 


Solutions 


1362 94 —42 32 
bo = = 1025; ы 11.75; bp= = 825; = 4.00 


TABLE 8A.3 ANALYSIS OF VARIANCE OF YIELD 


df. 8.8. m.s. 
Linear model 3 1453 484.3 
Lack of fit 4 23 5.8 


solutions (foot of table 8A.2), the fitted equation (rounded slightly) is 
ý = 170.2 + 11.82, — 5.225 + 4.025 


In the analysis of variance, the reduction in sum of squares of y due 
to the regression separates into the usual reduction due to the mean 


(1362)? 


= 231,880 
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and the reduction due to the linear terms 


(94)? + (42)? + (82)? 


= 1,453 
8 


The sum of squares of deviations from the fitted regression, i.e., 
Х- fu)? is found by the short-cut method as 


4 
Уу. – У b;(iy) = 233,356 - 231,880 — 1,453 = 23 
ч i=0 


This sum of squares contains two sources of variation., It receives a 
contribution due to the experimental errors, which make the values y, 
deviate from the true response surface $. It is inflated also by any 
failure of the linear equation to represent the correct shape of the re- 
sponse surface $. If the corresponding mean square is substantially 
larger than the mean square for experimental error, this is a warning 
that the linear equation does not adequately represent the true re- 
sponse surface. Consequently, the mean square of the deviations is 
sometimes described as measuring the “lack of fit.” 

These data, chosen for simplicity of illustration, do not furnish any 
estimate of the experimental error mean square with which the mean 
square for lack of fit can be compared. In practice, provision for an 
estimate of error is essential when first or second order designs are em- 
ployed, since otherwise the experimenter has no idea whether the equa- 
tion is an adequate approximation to the true surface. Some methods 
for obtaining an estimate of error for first order designs are discussed in 
section 8А.15. 


8A.14 Relation to the Standard Factorial Analysis. If we had applied 
the standard method of analysis for a 23 factorial experiment (chapter 5), 
the data in table 8A.2 would have been analyzed into the main effects 
A, B, and C, the two-factor interactions AB, AC, and BC, and the three- 
factor interaction ABC. In table 8A.2 the treatment combinations have 
been written in standard factorial notation on the left of the matrix X. 
It is easily verified that the right sides of the normal equations are, re- 
spectively, the grand total G and the factorial effect totals of А, В, 
and С. Since the main effects are 14 of the corresponding factorial effect 
totals, the coefficients бу, b», and Ъз are ¥ of the main effects of A, B, 
and C, respectively. 

It can be shown, as might be expected, that the 4 d.f. for “lack of fit” 
in table 8A.3 represent the contributions due to the interactions AB, 
AC, BC, and ABC. In fact, if we fit a more complex regression model 
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of the form 
2 = bozo + bızı + bere + bata + я 
+ Ызлихз + boats + 01232110273 
it will be found that 
bam Lam DU ЗЕ ROO EE uS тұр УЫ 
2 2 2 2 


А response surface of this type is unlikely to be considered, however, 
since there seems little reason to include cross-product terms like түлә. 
without including terms in 217 and 227. For squared terms of this type, 
more than two levels of each factor must be introduced. 


8А.15 Some Useful First Order Designs. The 2 factorial designs, 
in single or fractional replication, are convenient in exploratory work for 
fitting a linear relation between the response y and the k factors or x- 
variables. The designs given in table 8A.4 also supply some d.f. for 


TABLE 8A.4 Some USEFUL 2% FACTORIAL DESIGNS FOR FITTING LINEAR 


EQUATIONS 
Size of 

No. of experiment Fraction Хо. of d.f. for 

factors (units) ofarep. “lack of fit” Plan no. 
3 8 1 4 DIE. 
4 8 % 8 6A.1 
5 8 и 2 6А.2 
5 16 м 10 6A.3 
6 8 и 1 6A.4 
6 16 и 9 6A.5 


“Jack of fit" by which the experimenter can judge whether the linear 
equation fits adequately, provided that he has a measure of the experi- 
mental error variance at his disposal. 

Plans for the fractional factorial designs in table 8A.4 are obtained 
from the plans in chapter 6A. The notation is easily translated into 
that of this chapter. For instance, the treatment combination ac in a 
24 factorial corresponds to 21 = 1, 29 = —1 zs = 1:24 1 

As given, these designs do not provide any estimate of the experimen- 
tal error variance. This can be obtained (i) by replication of the whole 
experiment, (ii) in some situations, by the use of an estimate from pre- 
vious experimentation, if there is convincing evidence that the error 
variance remains stable through time, (iii) by adding to the 2% factorial 
а number of tests made at the point at which all z's have the value 0 in 
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the coded scale. Since the levels of every = in the 2" factorial are —1 
and +1, this point may be called the center of the design. 

Suppose that the center point is replicated n; times. As сап be seen 
by writing down the normal equations, these extra points do not change 
the values of any of the regression coefficients b;, except that bo becomes 
the mean of the whole experiment. The extra points serve two pur- 
poses. The sum of squares of deviations of the n, responses at the cen- 
ter from their mean furnishes (n; — 1) d.f. for measuring the experi- 
mental error. Further, if ў is the mean response at the central points 
and 72 is the mean response at the exterior points (i.e., the points in the 
2* factorial), the comparison (ў — ğı) gives an additional d.f. for meas- 
uring the lack of fit. (If the true response surface is quadratic rather 
than linear, this comparison estimates the sum of the coefficients of the 
terms іп 217, 25? --- in the quadratic surface.) The sum of squares for 
this single d.f. is computed as 


2 
> = si (1 — 72) 
where ng is the number of exterior points. 

Since the linear equation in k z-variables contains (k + 1) regression 
coefficients that must be estimated, the smallest experiment to which a 
linear equation can be fitted is one that has (К + 1) observations. 
Plackett and Burman (84.5) and Box (8A.1) have shown how to con- 
struct the most efficient designs of this size. For practical purposes the 
use of these optimum designs is likely to be limited, since they supply 
no d.f. for measuring lack of fit and since most of the designs are less 
convenient to handle than the 2* factorials. 


8A.2 Second Order Designs 


8A.21 The Quadratic Response Surface. The general form of a 
quadratic (second degree) polynomial is illustrated by the equation for 
two z-variables 


= Bo + Віла + Batou + Bitin + Batou? + Bistriut2u + би 


The nme contains linear terms in tiu, Zou, squared terms in т, 
Tou, and the cross-product term 21,24. 

In order to estimate the regression coefficients in this model, each 
variable z;, must take at least three different levels. This suggests the 
use of factorial designs of the 3” series. If the three levels of any x are 
coded as —1, 0, 1, the second order surface is easy to fit to the results 
of а 3* factorial. One disadvantage of the 3* series, however, is that with 
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more than three z-variables the experiments become large, although this 
difficulty can be reduced by the use of fractional replication. Further, 
the coefficients B11, B22 of the squared terms are estimated with rela- 
tively low precision from a 3* factorial, as Box and Wilson (8A.6) have 
pointed out. 

"These authors (8A.6) developed new designs specifically for fitting 
second order response surfaces. Their earliest designs, called composite 
designs, are constructed by adding further treatment combinations to 
those obtained from a 2Ё factorial. If the coded levels of each a-variable 
are —1 and +1 in the 2* factorial, the new designs are as follows. 

Central composite designs. Test the (2k + 1) additional factor com- 
binations 


(00,50; (90, ++ 0)3 (0, 850; 
(0, —а, 50; (09, 59; 
(0,0, 5-723; ($059 


The total number of treatment combinations is (2* + 2k + 1). For 
2, 3, and 4 z-variables, the experiment requires 9, 15, and 25 units re- 
spectively, as compared with 9, 27, and 81 in the 3* series, The value 
of а can be chosen to make the regression coefficients orthogonal to one 
another, or to minimize the bias that is created if the true form of the 
response surface is not quadratie, or to give the design the property of 
being rotatable (section 8A.22). 

The central composite design can be fitted into a sequential program 
of experimentation. T he program starts with an exploratory 2* factorial 
to which a linear response surface is fitted. Suppose, however, that the 
center of the first experiment is actually close to а point of maximum 
response. The results of the first experiment may indicate, by the “lack 
of fit" terms, that the response surface is curved, and may suggest that 
the center of the experiment is close to a maximum. By adding the 
(2k + 1) treatment combinations needed to create a central composite 
design, the experimenter utilizes the results of the initial 2% experiment 
over again when he fits the quadratic surface. This procedure requires 
the assumption that there are no time trends which change the level of 
y between the first and the second experiment. When time trends are 
present, the designs in section 8A.27 are preferable. 

The results of the 2* factorial may suggest that the point of maximum 
response is nearer to one of the other factor combinations than to the 
center. In this event a non-central composite design js likely to give a 
more precise location of the maximum. 


Non-central composite designs. This design has k extra points, one for 
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each factor. The level of any factor is moved to (1 + a) if the optimum 
level seems nearer to 1 than to —1, and to (—1 — a) if the optimum 
level seems nearer to —1 than to 1. For instance, if the optimum is 
thought to be near (1, —1, 1) with three factors, the extra combinations 
are as follows: 


(ПЕР 1 (11 о, 209; (1, -1,1--а) 
Figure 8A.1 illustrates the two types of composite design for three 


factors. 
x 


x 
Central composite design 


* = Points of initial 
23 factorial. 
x = Additional points 


added for composite 
design. 


| 
| 
| 
x 
Non-central composite design 
Ето. $А.1 


8A.22 Rotatable Second Order Designs. In order to appraise differ- 
ent designs that may be proposed for fitting a second order response 
surface, some criteria are needed as to what constitutes a good design. 
One useful property is, of course, that the computations should not be 
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too difficult. In a more intensive consideration of desirable properties, 
Box and Hunter (8A.7) proposed the criterion of rotatability. 

To explain this concept, suppose that the point (0, 0, ---, 0) repre- 
sents the center of the region in which the relation between y and the 
x’s is under investigation. Let 


k k 
Qu = bo + È bitis + У baa? + У булыл 


i=l ial i<j 


be the estimated response at a point on the fitted second order surface. 
From the results of any experiment we can compute s(ĝu), the standard 
error of u, at any point оп the fitted surface. This standard error will 
be a function of the coordinates z;, of the point. In a rotatable design, 
this standard error is the same for all points that are at the same dis- 
tance p from the center of the region, i.e., for all points for which 


т? + той poc ты? = р? = constant 


This property is a reasonable one to adopt for exploratory work, in 
which the experimenter does not know in advance how the response 
surface will orient itself with respect to the z-axes. Consequently he 
Ваз по rational basis for specifying that the standard error of fu should 
be smaller in some directions than in others. 

With two z-variables, Box and Hunter showed that a rotatable design 
is obtained by making tests at 7 points equally spaced around the cir- 
cumference of a circle in the ту, 2 plane with center (0, 0), plus one or 
more tests at the center itself. The points on the circumference lie at 
the vertices of a regular polygon inscribed in the circle. Since there are 
six regression coefficients to be determined when k = 2, the smallest 
design consists of a pentagon, i.e., five points around the circumference, 
plus one point at the center. The general formulae for the actual values 
of zı and 2; may be written as in table 8A.5. The factor о is the radius 
of the circle on which the points lie. 

The replicated points at the center have two purposes. They provide 
(n, — 1) 44. for estimating the experimental error, and they determine 
the precision of at and near the center. If there are many replications 
of the center point, the standard error of ў is low at the center and in- 
creases rapidly as we move away from the center. With only one or 
two center points, on the other hand, the standard error of 2 may be 
greater in the center than at points like (1, 0) or (0, 1). As a com- 
promise, Box and Hunter suggest that the number of center points be 
chosen во that the standard error of ў is approximately the same at the 
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TABLE 8A.5 ROTATABLE SECOND ORDER DESIGNS IN TWO Z-VARIABLES 


ті E] 
а cos 9 «віп0 
а cos 20 а вт 29 
. та points on the circumference 


0 = 2x/na 
с с08 120 asin nð 
0 0 
0 0 
1 p nı points at the center 
0 0 


center as at all points on the circle with radius 1. If this choice is made, 
the standard error remains roughly the same at all points within the 
circle of radius 1. 

With three or more z-variables, rotatable designs are obtained by 
placing the points (21, T2, +++ 2,) at the vertices of certain of the regular 
solid figures, or combinations of two solid figures, plus points at the 
center. The non-central points do not always lie on a sphere. 


8A.23 Central Composite Rotatable Designs. The rotatable designs 
most likely to be useful in practice belong to a series that are also central 


Fia. 8A.2 Central composite rotatable design in two z-variables. 


composite designs. Figure 8A.2 Shows the design in two z-variables, 
obtained from table 8A.5 by putting л» = 8, m = 5, а = 4/2. The 
design may be subdivided into three parts, 
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(i) The four points (—1, —1), (1, — 1), (— 1, 1), and (1, 1) constitute 
a, 2? factorial. 
(ii) The four points (—4/2, 0), (4/2, 0), (0, - 4/2), (0, 4/2) are the 
extra points included to form a central composite design with 
а = 4/2. The figure formed by these points is called a star. 
(iii) Five points are added at the center to give roughly equal preci- 
sion for 2 within a circle of radius 1. 


Rotatable designs for any number k of z-variables can be built up 
from these three components. The value of а must be 24/4 in order to 
make the design rotatable. Table 84.6 shows the components of the 


TABLE 8А.6 COMPONENTS OF CENTRAL COMPOSITE SECOND ORDER ROTATABLE 
DESIGNS 


No. of Number of points in 
z-variables | 25 fac- Total Value 
k torial Star Center N of a 
3 8 6 6 20 1.682 
4 16 8 7 31 2.000 
5 16 10 6 32 2.000 
6 32 12 9 53 2.378 


design for k = 3, 4, 5, 6. Note that with 5 and 6 z-variables, the size 
of the experiment is reduced by using a half-replicate of the 2* factorial. 
With а half-replicate, а becomes 2“*~)/*, Plans 8A.1 to 8A.3 at the 
end of this chapter give the designs for k = 4, 5, 6. The plans for k = 
2, 3 are presented in the following sections. These plans assume that 
the experiment is to be completely randomized. If the different treat- 
ment combinations are applied one after another, the order in this se- 
quence should be randomized. 


84.24 Statistical Analysis. The design for two z-variables is shown 
in table SA.7. The columns headed ті and хо, which specify the actual 
combinations to be used, constitute the plan for the experiment. Sub- 
sequent steps are as follows. 

1. Complete the columns headed zo, 21°, лә? and 2122 as shown. The 
two-way array, with 6 columns and 13 rows, comprises the X matrix 
of the z-variables. The corresponding values of the response y are 
placed on the right. 

2. Form the sum of products of each column in the X matrix with 
the column of y values. These sums of products are denoted by (Oy), 
(ly), (2y) and so on. 
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(It is not necessary to compute the X’X matrix of sums of squares 
and products of the z-variables. However, this matrix is shown in 
table 8A.7 in order to exhibit its structure, which remains the same for 
any value of Е. Note that the coefficients b;, b» of the linear terms and 
the coefficient b; of the cross-product term are orthogonal to all the 
others. The only coefficients which are entangled are bg and the coeffi- 
cients b11, b22 of the squared terms.) 

3. From the values of (Oy), (ly), ete., the regression coefficients are 
computed directly by the equations given at the foot of table 8A.7, The 


TABLE 8А.7 CENTRAL COMPOSITE ROTATABLE DESIGN IN TWO Z-VARIABLES 


X = Matrix of z-variables Y 
Design 
ey, 
т т 23 Ru р. түл y 

1 = -1 H 1 1 
1 1 -1 1 1 --1 
1 -1 1 1 1 -1 
1 1 1 1 1 1 
1 -1.414 0 2 0 0 
1 1.414 0 2 0 0 
1 0 —1.414 0 2 0 
1 0 1.414 0 2 0 
1 0 0 0 0 0 
1 0 0 0 0 0 
1 0 0 0 0 0 
1 0 0 0 0 0 
1 0 0 0 0 0 

хх ХУ 

13 0 0 8 8 0 (Oy) 

0 8 0 0 0 0 (1%) 

0 0 8 0 0 0 (2у) 

8 0 о 12 4 0 ау) 

8 0 0 4 12 0 (22у) 

0 0 0 0 0 4 (12у) 

Solutions 


bo = 0.2(0y) — 0.1 У (iiy); where Хай) = (11у) + (22y) 
b; = 0.125 (ty) 

bii = 0.125(tiy) + 0.01875 У) (iiy) - 0.1 (0y) 

bij = 0.25(17у) 


only auxiliary quantity needed is DX (у), found by adding the cross- 
products of all the squared terms with y. 
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4. In the analysis of variance, it is usually of interest to partition the 
sum of squares of the y’s into the contribution due to a first order (linear) 
equation, the additional contribution due to the second order (quadratic) 
terms, a ‘lack of fit’? component which measures the deviations of the re- 
sponses from the fitted surface, and finally a measure of the experimen- 
tal errors, obtained from the replicated points at the center. General 
formulae for the sums of squares are as follows: 


Sums of squares 


94. 
k 
First order terms: DX diy) k 
а 
k 
Second order terms: (0) + У Бб) + У bu(y) — G/N мар 
i=l i<j 
Lack of fit: Found by subtraction п — Ei» 
Experimental error: E (уш – 9? т-і 
N 
Total: E w – @/N т + т – 1 
и 


In these formulae G is the grand total, N the number of points, and 
the yi, represent the responses at the central points, with mean 91. 

The multiplying factors which give the regression coefficients (foot 
of table 8A.7) also provide their standard errors. For k — 2 


в.е. (b) = sV 0.125 = 0.354s 
в.е. (b) = sV 0.125 + 0.01875 = 0.3798 
в.е. (bj) = sV 0.25 ` = 0.58 


where в is the standard error per observation. Note that in the case of 
Б, two factors are added to give the multiplier for the standard error. 
These relations between the standard errors and the multipliers hold 
for all the solutions given in this book. 


8A.25 Numerical Example of the Analysis for 3 2-variables. The 
data come from a larger investigation by Moore et al. (8A.8) on the 
effects of the minor elements copper (Cu), molybdenum (Mo), and iron 
(Fe) on the growth of lettuce in water culture. The statistical analysis 
has been described by Hader et al. (8A.9). Table 8A.8 shows the design, 
the X matrix, and the lettuce yields y. 

A preliminary step in any experiment of this type is to set up the rela- 
tions between the coded z-scales and the original scales in which the 
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levels are recorded. For Cu and Mo, the experiment was intended to 
explore the range of concentrations between 0.0002 and 2 parts per mil- 
lion (ppm). In the design scale (table 8A.8), the lowest and highest 


TABLE 8A.8 CENTRAL COMPOSITE ROTATABLE DESIGN FOR k — 3 


X = Matrix of z-variables Y 
Design 
-----оұо-: 
20 Ty T2 13 ті z? 13 түгә 2113 тәтз y 
1 —1 —1 -1 1 1 1 1 1 1 16.44 
1 1 -1 -1 1 1 1 -і -1 1 12.50 
1 -1 A -1 1 1 1 —1 1 -1 16.10 
1 1 1 -1 1 1 1 1 —1 --1 6.92 
1 -l -1 1 1 1 1 1 -1 -1 14.90 
1 1 -1 1 1 1 1 —1 1 -1 7.83 
1 —1 1 1 1 1 1 -1 -1 1 19.90 
1 1 1 1 1 1 1 1 1 1 4.68 
1 -1.682 0 0 2880 0 0 0 0 17.65 
1 1.682 0 0 2.828 0 0 0 0 0 0.20 
1 0 —1.682 0 0 2.828 0 0 0 0 25.39 
1 0 1.682 0 0 2.828 0 0 0 0 18.16 
1 0 0 -1.682 0 0 2.828 0 0 0 7.87 
1 0 0 1.682 0 0 2.828 0 0 0 11.99 
1 0 0 0 01710 о10 0 0 0 22.22 
1 0 0 0 оо 0 0 0 0 19.49 
1 0 0 0 0, „рО, 0 0 0 22.76 
1 0 0 0 ооо 0 0 0 24.27 
1 0 0 0 ооо 0 0 0 27.88 
1 0 0 0 е 7-70 0 0 0 27.58 
824.18 
Solutions 


bo = 0.166338(0y) — 0.056791 У) (iy) 
b; = 0.073224(iy) 
bi; = 0.062500(iiy) + 0.006889 Y^ (iiy) — 0.056791(0y) 
Б; = 0.125000(tjy) 
values of z are —1.682 and 1.682. Rounding slightly, we take 
z — —1.68 when conc. — 0.0002; z — 1.68 when conc. — 2. 


Further, it was decided to express y as a quadratic function of the log 
concentration rather than of the concentration itself. Write 


x = а + blog (conc.) 


where a and b are chosen to satisfy the desired conditions at the ends of 
the scale. We find, for Cu and Mo, 
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x = 1.4272 + 0.84 log (сопе.) 


From this equation the concentration levels corresponding to x = —1, 
0, and 1 are determined. For Fe the range in concentration was from 
0.0025 to 25 ppm, and the equation is 


д = 0.5057 + 0.84 log (conc.) 


In the original experiment, the Fe concentration for х = —1 was set 
wrongly owing to a computational error, so that the actual design was 
not quite rotatable and required a more complex analysis. To avoid 
this complication, estimated yields for the correct concentration have 
been used here for the four observations that were affected by this error. 

The statistical analysis follows the procedure outlined in the previous 
section. Each column of the X matrix is multiplied in turn by the Y 
column, giving the sums of products (Oy), (10), etc., shown in table 
8A.9. The auxiliary total У (iiy) is computed, and the regression co- 


TABLE 8A.9 REGRESSION COEFFICIENTS AND ANALYSIS OF VARIANCE 


(Oy) 324.180 bo 24.0401 
(ly) —64.761 by —4.7420 
(2y) —16.281 b, —1.1885 
в) — 3121 b, 0.2285 
аш) 149.750 bu —5.4262 
(22у) 222.429 bs —0.8837 
(33у) 154.020 Ъз —5.1593 
(12у) —13.390 by -1.6738 
(1%) —9.170 bis -1.1462 
(23) 7.770 bs —0.9712 


У) (iiy) 526.199 


df. 8.8. m.s. 
First order terms 3 327.1 109.0 
Second order terms 6 760.3 126.7 
Lack of fit 5 40.1 8.0 
Error 5 52.6 10.5 
Total 19 1180.1 


efficients bo, b, ete. are found by the equations given in the solutions at 
the foot of table 8A.8. 

The sums of squares for the first and second order terms in the analy- 
sis of variance (foot of table 8A.9) are obtained from the general formulae 
in section 8A.24 as follows: 
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First order terms: 


k 
E b:(iy) = (—64.761)(— 4.7420) + (— 16.231) (—1.1885) 
F + (3.121) (0.2285) 

= 327.1 


Second order terms: 


k 
bo(0y) + 2 Бау) + 35 bay) — G/N 


іші i<j 
= (324.180) (24.0401) + (149.750) (— 5.4262) 
+++ (7.770)(—0.9712) — (324.18)2/20 
= 760.3 


The total sum of squares and the error sum of squares (from the 6 central 
points) are computed in the usual way. The sum of squares for lack of 
fit is obtained by subtraction. Since the mean square for lack of fit is 
about the same size as the error mean square, a second order surface 
appears to be adequate. Both the first and second order terms give 
significant mean squares. 


8A.26 Examination of the Fitted Surface. If there are only two z- 
variables, the interpretation of the results is facilitated by plotting the 
contours of equal response. То draw a given contour, j is set equal to 
the appropriate response level. For each of a set of values of 21, the 
fitted equation is solved as a quadratic in zs. This gives а series of 
points (21, тә) that lie on the contour. Alternatively, ў can be calcu- 
lated from the fitted equation for each of a grid of values of тү and 7; 
which cover the experimental region. The series of contours are then 
drawn in freehand. 

With three z-variables, a three-dimensional representation of the 
contours is illuminating. Choose one of the variates, say zs, as the 
“vertical” variate. For each of a series of equally spaced values of 3, 
draw the 21, x2 contours on transparent paper, using a separate sheet 
of paper for each value of хз. These sheets are then placed one above 
the other on a vertical frame, at heights above the base which repre- 
sent the values of z;. A fuller description of this type of model, with a 
photograph, is given by Box (8А.13). 

If there аге more than three x-variables, geometrical representation 
can be used only partially Тһе most helpful device is to transform the 
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response surface to its canonical form (see section 8A.55). The inter- 
pretation of the canonical form is discussed, with examples, in references 
(8A.10), (84.13), and (8А.14). Reference (8A.13) contains an experi- 
ment with five z-variables. 


8A.27 Designs in Incomplete Blocks. Arrangements of the central 
composite second order designs in incomplete blocks have been given 
by DeBaun (8A.18) and Box and Hunter (8A.7). The design for two 
z-variables (table 8A.10) shows the typical features. The 2* factorial, 


TABLE 8А.10 CENTRAL COMPOSITE SECOND ORDER DESIGNS IN INCOMPLETE 


BLOCKS 
2 z-variables N = 12 treatment combinations 
Block I Block IT 
ті та ті т 
-1 -1 -1.414 0 
1 =1 1.414 0 
=} 1 0 —1.414 
1 1 0 1.414 
0 0 0 0 
0 0 0 0 
Solutions 


bo = 0.25(0y) — 0.125 У) (iiy) 

b; = 0.125(iy) 

bii = 0.125(iiy) + 0.03125 У (iiy) — 0.125(0y) 
bij = 0.25(ijy) 


plus some central points, forms опе block, ог sometimes two blocks in 
the larger designs. The star part of the design, plus some central points, 
forms an additional block. Plans 8A.4 to 8A.7 give designs for 3 to 6 
x-variables. 

As usual, the blocking provides the opportunity for obtaining in- 
creased precision. If the treatment combinations are tested in sequence, 
blocking enables part of the effect of time trends to be eliminated from 
the experimental errors. In particular, if the initial experiment is a first 
order design, the 2% factorial should be set out as in the blocked plans. 
The augmented part required to fit a second order surface can be added 
later without worrying about whether time trends have changed the 
level of у. 

In the designs shown, blocks are orthogonal to the coefficients in the 
fitted quadratic surface. This simplifies the analysis. In order that 
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the orthogonality condition hold, the average value of ў must be the 
same in every block. In table 84.10, for instance, it may be verified 
that in both blocks I and II the average value of ў is 


bo + 201 + 26 


The orthogonality condition determines the value of a in the star part 
of the design. Thus, in table 8A.10, if the non-zero values of x; and zs 
are taken as —а and а in block IJ, the mean value of ў in this block 


works out as 
2 2 


DP ih 
0 etl eae 


In order to agree with the mean value of block I, we must take а? = 2, 
or a = 1.414. In some of the designs, the value of о required for ortho- 
gonality differs slightly from that required for rotatability, so that the 
rotatable feature must be sacrificed to some extent. 

The computation of the coefficients in the quadratic surface remains 
essentially the same as in section 8A.24. Write down the X matrix and 
the Y vector, ignoring the blocks. The sums of products (Oy), (1y), ete. 
are then formed and the regression coefficients are calculated from the 
equations given at the foot of each plan. 

In the analysis of variance, the sums of squares for the first order 
and second order terms are found by the formulae in section 8A.24. 
The blocks sum of squares is computed in the usual way as 


яңыса 

ЖЕЗ. 

Qi вы У 
where y; is the number of units in the ith block. (In some plans, vy; 
differs from block to block.) The sum of squares for experimental error 
is the pooled sum of squares between central points in the same block. 
In table 8A.10, for instance, this sum of squares has 2 d.f., one from 
each block. The sum of squares for lack of fit is found by subtraction. 


8A.3 Methods for Determining the Optimum Combination of Factor 
Levels 


8A.31 Nature of the Problem. In recent years, progress has been 
made on a problem of great importance in applied research. A response 
у is thought to be affected by a number of quantitative factors zi, 
T2 +++ ль. An experimental program is undertaken so as to discover the 
level at which each of these factors must be set in order to maximize 
the response. How should the program be conducted? This problem 
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ін common in research and developmental work in industry, where the 
investigator is seeking ways to manufacture a new product with certain 
desired characteristics, or to obtain an old product more economically. 
Examples are found in other fields also, as for instance the search for 
the best nutrient medium in which an organism can multiply in the 
biological laboratory. 

The response may be the actual amount of product or a measure of 
the quality of the product. Sometimes the object is to minimize y, as 
when y represents the amount of an undesirable by-product or the cost 
of manufacture per unit of output. The problem of locating a minimum 
does not require separate discussion, since a change in the sign of the 
response turns the problem into one of maximization. 

In, addition to locating the maximum of y, it is of value to learn 
something about how y varies, in the neighborhood of the maximum, 
when the levels of the factors are changed from their optimum values. 
This is so for several reasons. 

(i) In large-scale application of the results, it may not be feasible 
to set every factor at exactly its optimum level. Some combination of 
levels different from the optimum may be cheaper to maintain. 

(ii) Frequently, more than one characteristic of the product is impor- 
tant. A shift in the 2; away from the optimum may be preferable be- 
cause of its effect on one of these other characteristics. 

Gii) The shape of the response surface near the optimum may provide 
clues to the nature of the underlying process, as illustrated by Box and 
Voule (8A.10). 

(iv) The surface may not have a true maximum in the region of ex- 
perimentation (or in the region in which the process can be carried on in 
practice). In this event we want to know the nature of the surface in 
regions of relatively high response. 

The location of an optimum usually requires a coordinated series of 
experiments. In the following sections, some methods for planning the 
series are described. In practical applications, each method demands 
considerable exercise of judgment by the investigator, and the methods 
cannot be presented as a set of inflexible rules. Moreover, since the 
problem has been investigated only recently, future work may result in 
modification of the methods or in the development of new methods. 

At the outset, the experimenter must have decided which factors are 
to be included in the program. Sometimes there are initially as many 
as a dozen or more factors that might influence the response. The 
methods to be described have been applied mostly for numbers of fac- 
tors between two and six, and become cumbersome for larger numbers 
of factors. Some preliminary weeding out of factors that seem likely to 
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be of minor importance is therefore advisable before starting the methods 
described here. The range within which the level of each factor is to be 
varied must also have been selected. 


8A.4 The Single-factor Method 


8A.41 Description. A good account of this method has been given 
by Friedman and Savage (8A.11). The experimenter first makes an 
advance estimate of the optimum combination of factor levels, which 
will be denoted by т1, 21, *** Ты. Since each experiment deals with 
only one factor, the factors must be arranged in the order in which they 
are to be experimented upon. In general, a good plan is to start with 
the factor that is expected to give the largest response. 

In the first experiment all factors except the first are held constant 
at their initial levels 21, 31, “<< хы. The purpose of this experiment 
is to find the level of тү which maximizes у for this particular combina- 
tion of levels of the other factors. There are various ways in which 
this experiment can be carried out. In order to establish a maximum, 
at least three levels of z; must be compared. Four or five levels may be 
advisable if the range of zı is wide and the position of its optimum is 
poorly known; or perhaps an initial experiment with five widely spaced 
levels, followed by a second experiment with three levels at a narrower 
spacing. Hotelling (8A.12) has shown that equal spacing of the levels 
is not in general the most efficient procedure for locating the optimum. 
For experiments with three levels he has worked out the most efficient 
spacing in situations where the position of the optimum 2; is fairly well 
delimited in advance. 

When it is clear that the experimental points straddle a maximum of 
y, the position of the maximum may be estimated by fitting a parabola 
to the observed responses. 

If the equation of the fitted parabola is 


Y = bo + biz, + ізі? 


with b negative, the optimum z; is known from calculus to be 


Sometimes the value of тү that gives the highest observed y is close 
enough to the estimated optimum. The optimum, however estimated, 
will be denoted by 212. 

In the second experiment, тү is fixed at the level тіз and the values of 
©з +++ хь remain fixed at their initial levels. Several levels of the second 
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factor are compared in order to find the optimum level of £2, say 229. 
The third experiment deals similarly with тз, the other factors being 
held at the levels 212, 222, 241, *-* хы, respectively. The process con- 
tinues until an optimum has been estimated in turn for the k factors. 

This ends the first round of experiments, the estimated optimum 
factor combination at this point being the set of levels 212, 222, *** 232. 
If this new set of levels is close to the initial set ті, 221, *** хы, and if 
the values of y have improved little during the first round, the experi- 
menter may decide to terminate the experiments, concluding that no 
appreciable gain over the initial estimate of the optimum is possible. 
If more substantial changes have occurred, a second round of the whole 
process is undertaken, starting with fixed levels 225, £32, *** ть and 
testing several levels of ті to determine whether the position of the 
optimum has changed from the level 212. At the end of this round а 
new set of estimates 213, £23, *** Тұз has been obtained. The experimen- 
{ег then surveys the situation once again, and decides whether to ter- 
minate or continue to a third round. 

Several variations may be introduced into the method. At the end 
of the second round, Friedman and Savage suggest moving along the 
vector defined by the ends of the first two rounds. This means that at 
the beginning of the third round all z-variables are changed simul- 
taneously, the change in 2; being proportional to (дз — z;?). This sug- 
gestion, which departs from the “single-factor” approach, resembles the 
“steepest ascent” method to be described next. It may be decided to 
omit some of the factors in later rounds, on the grounds that they have 
shown no effects in earlier rounds. 

This method does not provide an estimate of the shape of the response 
curve. When the process has terminated, it will be advisable to conduct 
additional experiments for this purpose. 


8A.5 The Method of Steepest Ascent 


8A.51 The First Experiment. This method was proposed in 1951 by 
Box and Wilson (8A.6) and has been further developed and expounded 
by Box and others (8A.10, 8А.13, 8A.14, 8A.15). Asin the single-factor 
approach, the maximum is located by means of a series of experiments, 
each planned from the results of the preceding ones. The results of any 
experiment must become known quickly if the method is to be expedi- 
tious. Further, like the single-factor approach, the method works best 
when experimental errors are small and a previous estimate of the error 
variance is known, because under these circumstances the individual 
experiments in the series can be kept moderate in size. 
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All factors are included in the initial experiments. At the end of 
each experiment a nolynomial approximation to the response surface 
(zi, то, ++ ту) is fitted to the results and is used to determine the 
nature of the next experiment. The first and second order designs de- 
scribed in sections 8A.15 and 8A.2 were developed in connection with 
this method. 

The first experiment has two purposes. 

(i) To fit a linear equation 


9 = bo + biz + 6522 +-+ dere 


аз an approximation to ф in the vicinity of the starting point. 

(ii) To test whether the linear approximation fits within the limits of 
experimental errors. 

The 2% factorial or fractional factorial designs described in section 
8A.15 are useful for this purpose. Note that the experiment must pro- 
vide some d.f. to measure the lack of fit, and that an estimate of experi- 
mental error is required. Unless there is a trustworthy external esti- 
mate of experimental error, the experiment itself must supply this, 
either by the addition of points at the center as mentioned in section 
8A.15 or by replication of all points. 

The artificial data to which a linear equation was fitted in section 
8A.13 will serve as an example. There are three factors, the time dur- 
ing which a reaction is run, the temperature at which the reaction is 
maintained, and the amount C of a chemical substance. The best initial 
estimate of the optimum combination is, say, 


Time: 6 hours; Temperature: 300°; Amount of C: 1.6 gm. 


The next step is to decide the amount by which each of these factors 
will be varied in the first 23 experiment. A rough guiding rule is to 
change the level of a factor by an amount that (i) allows the effect of 
the factor a chance to show up, and (ii) is not large enough to cause 
doubts that the response will be curved in the interval between the two 
levels. Errors in these choices can be corrected by later experiments: 
a good choice means the optimum is attained more quickly. 

In the example, the investigator decides that the time of reaction can 
be changed by 1 hour, the temperature by 80°, and the amount of C 
by 0.4 gm. Taking the initial factor combination as the center of the 
set of levels to be tested, the first experiment consists of the 8 combina- 
tions of the following factor levels: 


ті Тіше 22 Temp. іі Ат. оС 
—1 5.5 hr. Т 260° —1 1.4 gm. 
+1 6.5 hr. +1 340° +1 1.8 gm. 
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Notice that coded levels —1 and +1 have been assigned to each of the 
three z-variables. In section 8A.13 the fitted linear equation in the 
coded scale was found to be 


9 = 170.2 + 11.82, — 5.25 + 4.023 


The experimental data did not provide any estimate of the experimental 
error variance. We shall suppose that a good external estimate of this 
variance was available, and that the mean square for lack of fit ap- 
peared to be well within the limits of experimental error. 


8A.52 The Direction of Steepest Ascent. When the first experiment 
has been completed, the region of experimentation is shifted to another 
set of levels of the 278. This set is to be chosen so that the maximum 
expected increase in y occurs. If the center of the first experiment is 
taken as the origin, the problem is to move from the origin, with x-coor- 
dinates (0, 0, - - - 0), to the position P, with coordinates (2^, z^», +++ 2^), 
so that the response $(2’;, z's, «++ т) is maximized. 

The change in ¢ clearly depends on the size of the jump that is made 
from 0 to the point Р, i.e., on whether we change the 278 by small or by 
large amounts. By geometrical analogy, the “distance” r from 0 to P 


is defined as 
r= Vial? + e uy (1) 


We suppose that the value of r has been chosen by the experimenter: 
the issues affecting this choice will be discussed later. For a given value 
of т, ie., a given size of jump, we determine the values of the =; that 
maximize $(x’;, 2/2, +++ 2’). The path of steepest ascent is the path 
from 0 to the point P whose coordinates are these maximizing т/. 

By calculus, the maximizing values of the z; are given by the equations 


where дф/дх; is the partial derivative of ф with respect to x;, taken аб 
the point P, and џ is a multiplier calculated so as to satisfy equation 


, ag\? 
(1). The actual value of и is r $ 3 E 
i 
In order to apply this method, we must estimate the partial deriva- 
tives 9ф/дх; from the results of the first experiment. This task is sim- 
plified if ф can be satisfactorily represented by a linear equation 
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$ = do + biti + bos Eod bite 


where фо is the value of ¢ at the origin. 
From the linear function, it follows that 


Thus the change in z; in the path of steepest ascent is proportional to 
b;, provided that the linear equation holds. 

The calculation of the path of steepest ascent is set out for the numer- 
ical example in table 8A.11. From the linear equation we have 


b, = +11.8; be = —5.2; з = +4.0 
in coded levels. 


TABLE 8A.11 CALCULATION OF PATH OF STEEPEST ASCENT 


ті та T3 
Time(hr) Temp. (°)  C(gm) 
(1) Relative change in design units = b; +11.8 —5.2 +4.0 
(2) No. of original units = 1 design unit 0.5 40.0 0.2 
(3) Relative change in original units +5.9 —208.0 +0.8 
(4) Change per 1 hr. change in Time 1.0 —35.3 +0.14 
Path of steepest ascent Predicted 
ў 
(5) Initial levels 6.0 300 1.60 
(6) Levels on path 7.0 265 1.74 201 
@ е и 8.0 229 1.88 232 
Отт, 9.0 194 2.02 263 


Row (1) shows the relative changes required in coded units. In rows 
(2) and (3), these are converted into relative changes in the original 
units. For Time, one coded unit represents 0.5 hr., so that 11.8 coded 
units represent 

(11.8)(0.5) = 5.9 hr. 


as in row (3). At this point it is convenient to select one of the variables 
as a standard, say Time, and to compute the changes in the other varia- 
bles that correspond to 1 hr. change in Time, as shown in row (4). 

To obtain the path of steepest ascent, start with the initial levels 
which formed the center of the first experiment. For each hour by 
which Time is increased, the Temperature should be decreased by 
35.3°, and the amount of С increased by 0.14 gm. Rows (6) to (8) 
show the factor combinations on the path of steepest ascent for Times 
of 7, 8, and 9 hr. 
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The expected yield ў at each point on the path of steepest ascent can 
be predicted from the linear equation, by expressing the z-levels on the 
path in coded units and substituting in the equation. These yields are 
shown in table 8A.11. 


8A.53 Subsequent Steps. The next step is to test a single factorial 
combination at some point on the path of steepest ascent, in order to 
discover whether the predicted increase in y actually takes place. The 
size of the jump that is made is a matter of judgment. In the example, 
a jump to the point (7, 265, 1.74) in the original units would probably 
be regarded as too small, since the first experiment contained a test at 
the point (6.5, 260, 1.8). The new test could be made either at the point 
(8, 229, 1.88) or at the point (9, 194, 2.02). The danger in making too 
large a jump is that it may pass beyond the optimum factor combina- 
tion. In cases of doubt it is best to choose the smaller of two proposed 
jumps, since at the worst this involves merely testing an extra factor 
combination that could have been omitted. 

If the actual yield in the new experiment is close to the predicted 
yield, a further jump is made along the same path. The process is con- 
tinued until the actual yield is substantially different from the predicted 
yield. 

At this point a new 2* factorial experiment is conducted, with its 
center at the last point on the path. If a linear equation is still a satis- 
factory fit to the new data, a new path of steepest ascent is calculated, 
and tests are made along this new path. 

In the course of time a situation is reached in which the 2% factorial 
gives one of the following results: 

G) The linear equation still appears to fit, but all coefficients b; are 
small. This suggests that we have reached a plateau. 

(ii) The lack of fit terms show that the linear approximation is in- 
adequate. This suggests that we are in a region in which the curvature 
of the surface must be taken into account. 

For further exploration of cases (i) and (ii), second order designs in 
which ¢ is approximated by a quadratic function of the 278 are used. 
Frequently the second order design can be constructed by adding addi- 
tional points to the last 2* factorial that was conducted, as in the ro- 
tatable second order designs (blocked or completely randomized) de- 
scribed in sections 8A.27 and 8A.23. Note that these designs provide 
a measure of the lack of fit of the quadratic surface and an estimate of 
the experimental error variance. 

Situations (i) and (ii) can, of course, arise in the first experiment. In 
fact, situation (i) is to be anticipated in the first experiment if the initial 
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estimate of the optimum is a good one and the optimum is flat. In this 
event we proceed at once to à design that permits the calculation of 
quadratie effects, without attempting to compute a path of steepest 
ascent from the first, experiment. 


8A.54 Correction for Errors in Level in the First Experiment. The 
importance of an appropriate choice of levels in the first experiment 
has been mentioned. Before discussing the interpretation of a quadratic 
response surface, it is worth noting how to correct for a poor choice of 
levels of a factor. 

Tf the levels of a factor are chosen too close together, the factor is 
likely to show a negligible or small effect in the first experiment, and the 
factor levels will be changed very slowly along the path of steepest 
ascent. Suppose that in table 8A.11 the levels of factor C in the first 
experiment had differed by 0.2 instead of by 0.4. The regression co- 
efficient ӛз in table 8A.11 would have been about +2.0 instead of 4-4.0. 
The relative change in хз in original units, in row (3) of this table, would 
have been (4-2.0)(0.1), ог +0.2 instead of 0.8. A halving of the inter- 
val between levels reduces the relative change along the path of steepest 
ascent to 14 of its original value. Thus too small an interval between 
levels may lead the experimenter to conclude that the factor is unimpor- 
tant and to omit it from consideration. 

When a factor shows little effect in the first experiment and it is 
suspected that the change in levels was too small, a much greater change 
in levels should be made in subsequent trials than the path of steepest 
ascent allows. If the factor actually produces no effect, this should be- 
come clear in later experiments, and the factor can then be dropped. 
Alternatively, а single confirmatory test can be run at the end of the 
first experiment, keeping the other factors fixed at their best levels in 
this experiment, and changing the level of the suspect factor by a sub- 
stantial amount. 

If the distance between the levels of a factor is too large in the first 
experiment, so that the response to this factor is markedly curved in 
the interval between the two levels, the failure of the simple linear equa- 
tion should be revealed by the lack of fit terms, with the consequence 
that the investigator proceeds quickly to experiments from which quad- 
ratic effects can be estimated. The effect on overall rate of progress 
should not be drastic. 


8A.55 Simplification of the Quadratic Surface. When the quadratic 
surface has been fitted, the position of the maximum value of ў, if one 
exists, is found by differentiating with respect to each z in turn. With 
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three factors, the surface is 
9 = bo + bızı + boo + bata + birti? + Doors? + basta” + biotite 


+ b;3z123 + beaters 


д 

ы = by + 201121 + Әз + bizza = 0 

Oxy 

9f 

— = be + diet: + 202272 + бззта = 0 

дтә 

ag 

rin bs + bisti + 92322 + 203313 = 0 
з 


Denote the solutions of these equations by 215, 225, 2зз, respectively. 
These give the factor combination for which ў has a maximum, or more 
generally a stationary, value. 

For further study of the response surface, some transformations of 
the z-variables simplify the task. The origin of the z-coordinates can 
be transferred to the stationary point by making the transformations 


di! wi — Tis; to! = Ta — tos) 2 = 13 — Tas 


With this substitution, the linear terms disappear from the quadratic 
expression, which becomes, for three factors, 


9 = йз + buti? + Dogs" + basta”? + алі та + Digi as! + злу ту 


where 
Ёз = bo + S(byzis + bares + batas) 


is the estimated stationary value of 2. 

The cross-product terms can also be made to vanish by transforming 
to new variables X; that are orthogonal linear functions of the z;. This 
transformation is familiar in both geometry and algebra. In geometry, 
the transformation amounts to rotating the coordinate axes so that the 
principal axes of the quadratic surface become the coordinate axes. In 
algebra, the transformation reduces the quadratic expression to its 
canonical form. Methods for making the transformation are described 
in (8A.14). 

In the new variables, the response surface becomes 


$ = fs + MX? + MX? + NX 


where the coefficients A; are functions of the 98, and the Х; are linear 
combinations of the 2. 
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8A.56 Interpretation and Further Experimentation. The next steps in 
the experimental program are determined by examining the form of the 
quadratic surface. From experience in the chemical industry, several 
different situations may be encountered. 

G) The A; are all negative. In this event the quadratic surface has 
а maximum at the point determined by the equations X, = Xz = Хз 
= 0. If this point is inside the region covered by the second order ex- 
periment, we have succeeded in locating a maximum. Further work 
will probably be confined to a confirmatory test at the estimated maxi- 
mum and perhaps some additional experiments designed to determine 
the function ф more precisely. 

A pietorial representation of the quadratic surface aids in interpreting 
the results. If there are only two factors, a series of contour lines can 
be drawn on graph paper as shown in Fig. 8A.3. A given contour shows 
all pairs of values of x; and хз for which the response has a specified 
value. When A and A; are negative, these contours are ellipses: the 
point S is the optimum combination of z; and хә. Figure 8A.3(a) 
shows a surface in which ^, and Аҙ are nearly equal: in Fig. 8A.3(b) № 
is much smaller than №. The limiting form of Fig. 8A.3(b), in which 
М» becomes zero, is shown in Fig. 8A.3(c). In this case the response 
remains the same as we move along any line X, — constant. In particu- 
lar, the maximum response can be obtained by any combination of val- 
ues of z; and т that makes X, = 0. Box has called this type of surface 
a stationary ridge. 

(ii) The A; are all negative, but the estimated maximum lies outside 
the region covered by the last experiment. In this event the estimated 
maximum may not be at all close to a true maximum, because the fitted 
quadratic can give very misleading results outside the region from 
which the data came. It is advisable to proceed cautiously in the gen- 
eral direction of the estimated maximum. If this path seems to lead 
toward a turning value, another experiment to which a new quadratic 
surface can be fitted is conducted further along the path. 

With two variables, the contqurs obtained from the first quadratic 
surface may appear as іп Fig. 8A.3(d) or (е). Case (е), in which A is 
near zero, has been called a rising ridge by Box, who recommends a move 
along the Хз axis in the direction in which the predicted response increases. 

(iii) Some à; are positive and some are negative. This turning value, 
illustrated for two factors in Fig. 8A.3(f), is called a minimax, being а 
minimum for some of the z-variables and а maximum for others. Should 
this case arise, the response is expected to increase most rapidly as we 
increase the X; that has the highest positive Xj. Factor combinations 
which give several increasing values of this X; may be tried. 
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Fig. 8A.3 Different types of contour surface. 


(iv) The quadratic surface does not fit. The best course of action 
varies from case to case. Addition of points in order that a cubic sur- 
face can be fitted may account for the discrepaney. In other cases it 
will be evident that no polynomial of low order provides a satisfactory 
approximation. Further experiments in the neighborhood of a point 
of relatively high response, with smaller intervals between the levels of 
the factors, may then be advisable. 


8A.6 Summary Comments 


8A.61 The Single-factor and Steepest Ascent Methods. On the 
score of simplicity the single-factor method has much to recommend it, 
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since it breaks the program down into a series of small experiments, 
The ideal situation for this method occurs when the response y has а 
single maximum and the factors are independent in their effects. The 
response curve, as we vary the levels of one of the factors, then has the 
same shape for any set of levels at which the other factors are fixed 
(although the position of the curve may depend on the levels of the other 
factors). It follows that the optimum can be reached at the end of a 
single round, apart from disturbances due to experimental errors and to 
the fact that a parabola may not be the exact shape of the response curve. 

When interactions among the effects of the factors are substantial, 
i.e., when the response surface is oriented as in Fig. 8A.3, (b)-(e), the 
single-factor approach tends to converge slowly and may require sey- 
eral rounds. With rising ridge surfaces as in Fig. 8A.3(e), Box (8A.13) 
has pointed out that the single-factor method may give the impression 
of having reached a maximum that is not the true maximum. 

The method of steepest ascent probes more thoroughly and is safer 
and more informative. It should frequently reach the optimum with 
less total experimentation, although it requires larger experiments and 
the method of analysis is more involved. 


8A.62 Use of a Single Large Experiment. The two previous methods 
require that the results of an experiment become known quickly, since 
each experiment awaits the completion and analysis of the previous one. 
In lines of work where an experiment must extend over a long period of 
time in order that the treatments produce their effects, the natural 
strategy is to try to discover the optimum combination at the end of a 
single experiment. Rotatable second order designs may be useful in this 
problem. 

If the region to be covered in the experiment is large, the response 
surface may be too complex to be approximated by a quadratic poly- 
nomial. Use of polynomials of higher order, or further exploration in & 
second experiment, as previously suggested, should be considered. 


8A.63 Selection of Test Points at Random. The possibility of using 
random combinations of the levels of the factors in exploratory work 
has been suggested, e.g., by Anderson (8A.16). If the factor 2; is to be 
studied between the levels 0 and 4, say, we choose a random number 
lying between these limits as the level of х;. An independent choice is 
made for each factor and for each successive factor combination that 
is tested. Alternatively, a number of equally spaced points, 0, 4, 1, 
1%, ---, 4 could be chosen, and a random choice of one of these levels 
made for each trial. At the end of n trials, the factor combination giv- 
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ing the highest observed response is regarded as an estimate of the 
optimum combination. 

Although this method has no planned strategy, it may have practical 
possibilities when the number of factors is large and experimental errors 
are small. Two goals which the experimenter might hope to accomplish 
at the end of a moderate series of preliminary trials are (i) to reach a 
region of reiatively high response and (ii) to learn something about the 
z-variables that have the greatest influence on y, in order that future 
experiments can be confined to a smaller number of factors. Suppose 
that the sub-region in the z-space in which the response is satisfactorily 
high has a volume which is a fraction a of the volume of the whole 
z-region that is being explored. If experimental error is ignored, the 
probability that at least one of the random combinations out of trials 
lies in the desirable sub-region is 

Р-1-(1-а) 
This equation holds irrespective of the number of factors. For instance, 
if a = 0.1, P exceeds 0.8 after 16 trials have been completed, 0.9 after 
22 trials, and 0.95 after 29 trials.* 

Since the levels of the different factors are independently assigned, 
methods of regression analysis can be applied to provide clues to the 
factors exerting most influence. This approach must be developed fur- 
ther and compared with other methods before its utility can be assayed. 


8A.64 An Experimental Comparison of the Four Methods. Direct 
comparisons of the performances of these methods in practical applica- 
tion are unlikely to be made for reasons of expense. A mathematical 
comparison which simulates a practical test has been conducted by 
Brooks (8A.17). He constructed four surfaces, each with well-defined 
maxima in two variables: typical contours are shown in Fig. 8А.4. For 
each surface, 9 different starting positions were tried. In his first series 
of artificial experiments, each method was allotted a total of 16 trials 
(1.е., 16 factor combinations) from which to locate-the optimum; in his 
second, 30 trials were allowed. A further variation was that the series 
were carried through both with no experimental error present and with 
a moderate amount of error. Several different methods of estimating 
the optimum for each method were investigated. 

The methods compared were the single-factor, the steepest ascent, 
the factorial, the selection of test points at random, and a stratified 
random selection of test points (not reported here). The factorial 
method represented the situation in which a single large experiment 


* S. Brooks, personal communication. 


368 SOME METHODS FOR STUDY OF RESPONSE SURFACES 84.64 


must be used. A 4 X 4 factorial was employed with 16 trials and а 
5 X 6 factorial with 30 trials. 


Typical contours of surface 
2 3 4 


Average performance on surface 


Method 1 2 3 4 
Single-factor .990 .984 .926 .984 
Steepest ascent 993 .989 979 .985 
Factorial 1955 1977 927 976 
Random .902 911 913 986 

Fia. 8A.4 Comparison of four methods of locating а maximum on 4 different 
response surfaces. 


The performance of any method was gauged by the true response 
$ti, 22) at the optimum values of zı and 2; as found by the method, 
The average results for each method over all trials are shown under 
Fig. 84.4, using for each method the estimating procedure that turned 
out to be best. 

Since the true maximum of each surface was 1, all methods did well. 
The steepest ascent method -was best on all four surfaces. The single- 
facto; method was almost as successful on all surfaces except surface 3. 
This is the type of surface for which опе of the Vs in the canonical form 
of the quadratic approximation is small, It frequently brought about a 
“rising ridge” situation in which the single-factor method made little 
progress after the first round. The factorial method was consistently 
poorer than the two sequential methods except on surface 3, where it 
performed about the same as the single-factor method. A common dif- 
бешу was that a polynomial surface (quadratic or cubic) did not fit 
the data satisfactorily because of the relatively wide spacing of the fac- 
torial grid. 

The use of random combinations of levels was distinctly inferior to 
the other methods, This method would not be expected to show prom- 
ise when there are only two z-variables, 

Needless to say, these conclusions may be altered if different types of 
response surface, different numbers of factors, or different amounts of 
experimental error are studied. For the surfaces covered, the results 
suggest an all-round reliability of the method of steepest ascent. 
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PLANS 
Central composite rotatable second order designs 
Plan 8A.1 4z-variable МУ = 31 treatment combinations 
24 factorial + star design + 7 points in the center 


zi 22 23 24 zi 2а 23 24 
-1 -1 -1 -1 -2 0 0 0 
1 -1 -l -1 2 0 0 0 
-1 1 —1 -1 0 -2 0 0 
1 1 -l -1 0 2 0 0 
= = ter gei 0 i'n 0 
1 -1 1 -1 0 0 2 0 
-1 1 1 -1 0 0 0 -2 
1 1 1 -1 0 0 0 2 
-1 -1 -1 1 0 0 0 0 
1 -1 -1 1 0 0 0 0 
-1 1 -1 1 0 0 0 0 
1 1 -1 1 0 0 0 0 
-1 -1 1 1 0 0 0 0 
1 -1 1 1 0 0 0 0 
-1 1 1 1 0 0 0 0 
1 1 1 1 
Solutions 


bo = 0.142857(0y) — 0.035714 } (iiy) 
b; = 0.041667(iy) 
bi; = 0.031250(iiy) + 0.003720 У) (iiy) — 0.035714(0y) 
bij = 0.0625(ijy) 
қ bá b; 
в.е. 0.2048 0.185 0.2508 
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PLANS 


32 treatment combinations 


N 
36 replicate of a 2° factorial + star design + 6 points in the center 


5 z-variables 


Plan 8А.2 


Tb 


74 


13 


та 


ті 


25 


T3 


z2 


со ее оси ое өе ә 


SOS еле LA SSS әз оше 


вот 


ЕЕ Е С 


nnoocooooocooooos 


Solutions 


bo = 0.159091(0y) — 0.034091 57 (iiy) 


b; = 0.041667(iy) 


bi; = 0.031250(iiy) + 0.002841 У (iiy) — 0.034091(0y) 


bij = 0.0625(ijy) 


bij 
0.250s 


bi 


bi 
0.2043 


0.1858 
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53 treatment combinations 


we 


6 z-variables 


Plan 8А.8 


36 replicate of а 2° factorial + star design + 9 points in the center 
р! 


T4 


T3 


T2 


ті 


22 з 44 25 16 


ті 


1 
1 
1 
—1 
1 
1 
0 
0 


оооооооосо 


с-сосооооооососоос 


Solutions 


bo = 0.110749(0y) — 0.018738 > (iiy) 


b; = 0.023087(iy) 


bi; = 0.015625(iiy) + 0.001217 У) (iiy) — 0.018738(0y) 


bij = 0.03125(ijy) 


bij 
0.1778 


bi 


0.130s 


bi 
0.152s 


в.е. 
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Central composite second order designs in incomplete blocks 
Plan 8А.4 3 z-variables М = 20 treatment combinations 
Block I Block IL Block III 
21 22 23 ті z2 та Ty т 13 
-1 -1 1 -1 -1 -1 —1.633 0 0 
P -f^ -f 1% -1 1 1.633 0 0 
—1 1 -і -1 1 1 0 —1.633 0 
1 1 1 1 1 -I 0 1.633 0 
0 0 0 0 0 0 0 0 —1.633 
0 0 0 0 0 0 0 0 1.633 
0 0 0 
0 0 0 
Solutions 
bo = 0.165385(0y) — 0.057692 У) (iiy) 
b; = 0.075(iy) 
bii = 0.070312(#)_+ 0.005409 У) (iiy) — 0.057692(0у) 
bij = 0.125 (iy) 
ы ы bij 
в.е. 0.2748 0.2758 0.3548 
Plan 8А.5 4 x-variables М = 30 treatment combinations 
Block I Block II Block Ш 
21 та тз 24 ті та 23 TA т та 13 TA 
-1 -1 -1 -l -1 -1 -1 1 -2 0 0 0 
1 =i =l Я 1 -1 -1 -1 2 0 0 0 
-1 1 -1 1 -1 19-1 -1 0 -2 0 0 
1 1 -1 =i 1 1 9-1 1 0 2 0 0 
-1 =l 1 1 -і -і 1 -1 0 0 -2 0 
1 -1 12-41 19 =l 1 1 0 0 2 0 
-1 1 1 -1 -1 if 1 1 0 0 0 -2 
1 1 1 1 1 1 1 =} 0 0 0 2 
0 0 0 0 0 0 0 0 0 0 0 0 
0 0 0 0 0 0 0 0 0 0 0 0 
Solutions 


bo = 0.166667(0y) — 0.041667 Хам 

b; = 0.041667(iy) 

bii = 0.031250(iiy) + 0.005208 X (iiy) - 0.041667(0y) 
bij = 0.0625(0у) 


ы bi bi 
в.е. 0.2048 0.1918 0.2508 
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Plan 8А.6 5 z-variables М = 33 treatment combinations 
Block I Block П 

Ti 22 13 24 15 ті T2 тз 24 25 

-1 -1 -і 1 -2 0 0 0 0 
E ча E -Е 2,10! ©: 0- 0 

ӘЛ -12 Б =a 6-22: 0: о о 
1 IJ =t l 0 2 0 0 0 

Жі ope —1 0 0 -2 0 0 
1 ss} 1 0 0 2 0 0 

-і 1 1 0 0 0 -2 0 
1 1 —} 0 0 0 2 0 

7 | —1 0 0 0 0 -2 
1 =g 1 0 0 0 0 2 

—1 1 1 0 0 0 0 0 
1 1 -1 

-1 -1 1 Solutions 


_1 % = 0.137255(0y) - 0.029412 (йу) 

b; = 0.041667(iy) 

bi; = 0.031250(iiy) + 0.001838 >> (йу) 
— 0.029412(0y) 

bi; = 0.0625(ijy) 


= 
l 
- 
T pou 
оророооннны н-нын 


bis bij 
0.1823 0.2508 


о 
г L d 3 АК н ЖЕТА 
a кк нн н н 
- 


в.е. 
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PLANS 


N = 54 treatment combinations 


6 z-variables 


Block I 


Plan 84.7 


Block П 


ті 


26 


ть 


24 


Ta 


T2 


zi 


moan ooofo 


1 
1 
d 
1 
-1 
=l 
-1 
—1 
-1 
-1 
-1 
-1 


Block III 


Solutions 
bo = 0.099600 (0y) 


— 0.016892 2: (iiy) 


b; = 0.023148(iy) 
bi; = 0.015944(iiy) 


+ 0.000862 У) (iiy) 


— 0.016892(0y) 


bij = 0.03125(ijy) 


8% 
5 
eo 


{ооороооооо с оо 


a 


bij 
0.1778 


ba 


bi 
0.152s 


0.1308 


в.е. 
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Incomplete Block Designs 


9.1 Balanced Designs 


As their name implies, these designs, introduced by Yates (9.1), are 
arranged in blocks or groups that are smaller than a complete replica- 
tion, in order to eliminate heterogeneity to a greater extent than is pos- 
sible with randomized blocks and latin squares. In the designs de- 
scribed in chapters 6 to 8, this reduction in the size of block was achieved 
by sacrificing all or part of the information on certain treatment com- 
parisons. The present designs, on the other hand, were developed for 
experiments in plant breeding and selection, where it is desired to make 
all comparisons among pairs of treatments with equal precision, Con- 
sequently, a different method for reducing the size of block is employed. 

The designs may be arranged either in randomized incomplete blocks 
or in quasi-latin squares. They may be balanced or partially balanced. 
The balanced designs will be illustrated first by simple examples of the 
experimental plans. 

Consider the plan in table 9.1 which compares 9 treatments in in- 
complete blocks of 3 experimental units with 4 replications. 

Every pair of treatments will be found to occur once, and only once, 
іп the same block. For instance, treatment 1 occupies the same block 


TABLE 9.1 BALANCED DESIGN FOR 9 TREATMENTS IN BLOCKS OF 3 UNITS 


Block Rep. I Rep. II Rep. III Rep. IV 
@ 123 6 147 (715 9 (0) 1 8 6 
(2) 4 5 6 (5) 258 (8) 7 256 (1) 4 29 
37829 (0 3 6 9 9483 (12) 753 


With treatments 2 and З in the first replication, with treatments 4 and 7 

in the second replication, with treatments 5 and 9 in the third replica- 

tion, and with treatments 6 and 8 in the fourth replication. When the 

results are analyzed by the method of least squares, this property, to 
376 
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which the adjective balanced is applied, ensures that all pairs of treat- 
ments are compared with approximately the same precision, even though 
the differences among blocks may be large. This design belongs to the 
group known as balanced lattices, so-called because the plans are con- 
veniently written down by drawing a square lattice, with the treatment 
numbers at the intersections of the lines. Tri the balanced lattices, the 
number of treatments must be an exact square while the number of units 
per block is the corresponding square root. 

Balanced designs can be constructed for other numbers of treatments 
and of units per block. The plan in table 9.2 shows 7 treatments ar- 
ranged in blocks of 3 units. 


TABLE 9.2 BALANCED DESIGN FOR 7 TREATMENTS IN BLOCKS OF З UNITS 


Block 
(1) 124 (3) 3 4 6 (5) 1 5 6 () 187 
(2) 235 (4) 45 7 (6) 2 6 7 


Again every pair of treatments occurs once within some block. In 
this case, however, the blocks cannot be grouped in separate replications, 
since 7 is not divisible by 3. Designs of this type are known as balanced 
incomplete blocks. 

For certain numbers of treatments and units per block, both the types 
above can be laid out in a kind of latin square formation so as to allow 
the elimination of variation arising from two types of grouping. The 
appropriate rearrangement for the first example is shown in table 9.3. 


TABLE 9.8 BALANCED DESIGN FOR 9 TREATMENTS IN 4 LATTICE SQUARES 


Rep. I Rep. И Rep. III Rep. IV 
Columns 
Rows (1) (2) (3) (4) (5) (6) (7) (8) (9) (10) (11) (12) 
() 19% (9:1. 477 (1.6 8 0071.9 5 
(2) 456 (5258 8924 (41156 «: 210 
з) 789 (0 з 6 9 9573 (2) 8 4 3 


It may be verified by inspection that every pair of treatments now 
occurs once in the same row and also once in the same column. All 
comparisons between pairs of treatments are of nearly equal precision. 
This design is known as a lattice square. 

The second example is rewritten somewhat differently. 

Every treatment now appears in each of the 3 rows, and every pair of 
treatments appears together once in the same column. Since the plar. 
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TABLE 9.4 BALANCED DESIGN FOR 7 TREATMENTS IN AN INCOMPLETE 
LATIN SQUARE 


Columns (Blocks) 
а) (2) (9 (9 (5) © (7) 


Rows 

(D: X 648 7 
(5 Ө дил. лар ы 
у: єй. Аш анд, Жау) Ce dug 


above could represent the first 3 rows of a 7 Х 7 latin square, this type 
of design has been called an incomplete latin square, or alternatively a 
Youden square, after W. J. Youden, who developed these designs for 
greenhouse experiments. 


9.2 Partially Balanced Designs 


Although a balanced design can be constructed with any number of 
treatments and any number of units per block, the minimum number of 
replications is fixed by these 2 variables. In most cases this number is 
too large for the usual conditions of experimentation, In order to allow 
more freedom of choice in the number of replicates, designs which lack 
the complete symmetry of the balanced designs must be used. 

Simple examples of such designs are the lattices. These are constructed 
in the same way as balanced lattices except that there are fewer replica- 
tions. The design with 2 replications (e.g., the first 2 replications in 
table 9.1) is called a simple lattice, and that with 3 replications a triple 
lattice. Similarly, with a lattice square, as in table 9.3, we may use less 
than the full number of replicates necessary for balance. For all these 
designs the number of treatments must be a perfect square. 

A further set, called cubic lattices, is useful when the number of treat- 
ments is very large. The number of treatments is the cube of the num- 
ber of units per block, so that a drastic reduction in block size is obtained. 
The number of replicates is 3 or some multiple of 3. 

Partially balanced designs are less suitable than balanced designs. 
The statistical analysis is more complicated. When the variation among 
blocks (or rows and columns) is large, some pairs of treatments are more 
precisely compared than others, and several different standard errors 
may have to be computed for tests of significance. These difficulties 
increase as the design departs more and more from the symmetry. of the 
balanced design. 
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The first comprehensive examination of the construction and proper- 
ties of designs that lack full balance was made by Bose and Nair (9.1a). 
They were particularly interested in developing designs for which the 
analysis should not be unduly difficult. For this to be so, the design 
must satisfy certain properties described in section 11.61a. Bose and 
Nair chose the name “partially balanced” for designs that satisfy these 
properties, although in the literature this phrase is sometimes found to 
denote any design that has some features of symmetry. 

The simplest of the partially balanced designs are those with two asso- 
ciate classes. In these, some pairs of treatments occur in the same block 
A, times, while other pairs occur in the same block № times, where № 
and № are whole numbers. In the lattice designs, which are of this 
iype, some pairs of treatments never occur together in a block, i.e., 
№ = 0; other pairs occur once in the same block (Хә = 1). 

А catalogue of 376 designs with two associate classes has been pub- 
lished by Bose, Clatworthy, and Shrikhande (9.22), including plans, in- 
structions for analysis, and worked examples. All these designs have a 
block size greater than 2 units. Instructions for obtaining 92 plans 
with block size 2 have been given by Clatworthy (9.32). These designs 
represent a great enlargement of the available selection of incomplete 
block arrangements. They should be particularly useful in experiments 
where the block size that permits precise comparisons is dictated by 
cireumstances beyond the control of the experimenter. 

Many of these designs can be arranged in a kind of latin square forma- 
tion in which variations among rows and columns can both be eliminated 
from the experimental errors. Table 9.1a shows an example for 6 treat- 


TABLE 9.1a A PARTIALLY BALANCED INCOMPLETE BLOCK DESIGN ARRANGED IN 
LATIN SQUARE FORMATION 


Columns (blocks) 

Rows (1) (2) (3) (4) (5) (6) 
(1) 1 2 3 4 5 6 
(2) 4 5 6 1 2 3 
(3) 2 3 1 5 6 4 
(4) 5 6 4 2 3 1 


ments in blocks of size 4. Each row forms a complete replication. With 
regard to the columns (blocks), & pair of treatments either occurs twice 
in the same block (e.g., treatments 1 and 2) or four times (e.g., treatments 
l апа 4). This type of design is appropriate in the same circumstances 
as the Youden square. 
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9.3 Basis of the Statistical Analysis 


9.31 Analysis without Recovery of Inter-block Information. То those 
who have previously used only the simplest designs, one feature of the 
incomplete block designs will be unfamiliar. The treatment averages 
must be adjusted in order to secure the full accuracy available in the de- 
sign. That this is necessary can be seen by examining, for example, the 
balanced lattice in table 9.1. In three of the 4 replications, treatments 
1 and 2 are in different blocks. Consequently, if the simple averages of 
the 2 treatments are compared, the effects of differences among blocks are 
eliminated only for the first replication. 

Yates developed two methods of analysis for the designs. The first 
is an analysis by standard least squares, reference (9.1), sometimes re- 
ferred to as the “intra-block” analysis. Later he showed that com- 
parisons among the block totals also contained information about treat= 
ment effects that can be utilized in the larger experiments. This ap- 
proach is analogous to that in split-plot experiments, where it will be re- 
called that use is made both of comparisons within whole-units (corre- 
sponding to incomplete blocks), and of comparisons among the totals of 
different whole-units. In this section the original method of analysis 
will be outlined for balanced lattice experiments. This method has not 
been entirely superseded by the newer analysis, because it must be used 
with small experiments. The newer analysis is sketched in section 9.32. 

For the original analysis the mathematical model and the assumptions 
are essentially the same as those for previous designs in this book, as 
described in section 3.2. In the balanced lattice there аге k? treatments 
in blocks of k units, with (k + 1) replications. Let y;;, be the observa- 
tion for the gth treatment, which we suppose to be in the jth block within 
the ith replication. The model is 


Уйа = в + met Bij + т. + ёа (9.1) 


where и, ту, Bij, and т, represent the effects of the mean, the replicate, 
the incomplete block, гапа the treatment, respectively, and е;; is the 
intra-block residual or error, assumed to be normally and independently 
distributed with mean zero and variance oe. As with previous designs, 
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we find estimates of the parameters by minimizing 
У (Wija =т= pi — bij — te)? 
subject to the usual relations 
> pi = 0, > 8-0 (foanyi, Хі-0 (9.2) 
4 


The normal equation for any unknown is obtained as before Бу equat- 
ing the observed total to the expected total over all units whose equation 
contains the constant. Thus, if T, denotes the total for all (k + 1) 
units that receive the qth treatment, the normal equation for tg is 


T, = (k-- 1)m + 3 + (k + 0% (9.3) 
the terms in the р; having disappeared in virtue of equations (9.2), since 
any treatment appears in all replicates. 

This equation does not at once give the value of tą, since it still contains 
the unknowns m and b,;. From the normal equation for m, it is easy to 
see that m is the average over the whole experiment. The normal equa- 
tion for any 5;; is 

Ву = km + kpi + kb + Хы (9.4) 
where Ву; is the observed block total, and the treatments sum is over the 


k treatments that are in the block. If we add these equations for all 
blocks that contain the gth treatment, we obtain 


By = МЕ + 1)т + kY bg 4-22 22 (9.5) 


where B, is the total of all such blocks. 

It is at this point that the structure of the design is important. In 
the blocks that contain tg, every other treatment occurs once and only 
once, while tg appears ( + 1) times. Hence 
EE = (k + Vlg + Х (all other йв) = kt, + У (all s) = № 
from (9.2). 

This simplifies (9.5) to 

B, = k(k + 1)m + kY bij + kta (9.6) 
We may now eliminate the unwanted term Y bj; from (9.3) and (9.6), 
giving 
kT, — B, = Pty (9.7) 
a relatively simple solution. In effect, we simply adjust the treatment 
total for the total of the blocks in which it appears. 
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Since f, represents the deviation of the estimated treatment mean from 
the mean for all treatments, it is customary in practice to compute 
(4 + m), which represents the estimated mean itself, A little algebraic 
manipulation from (9.7) shows that 

Т, + [kT ғ (k Ф 1)В, als G] 
kl E + 1) 
where G = k7(k + 1)m is the grand total for the whole experiment. 
Note that 7,/(k + 1) is the simple or unadjusted treatment mean. The 
last term on the right therefore represents the adjustment that is applied 
to the ordinary mean. This result will be used in a later comparison with 
the newer analysis. 

A continuation of the analysis shows that the variance of the difference 
between two t, values is 2c,2/k. If the experiment had been in ran- 
domized blocks, with the same number of replications, (k + 1), the cor- 
responding variance would have been 207/(k + 1), where c? is the error 
variance for randomized blocks. Hence the new design, with this method 
of analysis, gives a more accurate experiment than randomized blocks if 
and only if 


и +т = (9.8) 


с? k 
— <-— 
` k+l 


The quantity k/(k + 1) is called the efficiency factor of the design. Note 
that the incomplete block design would be less accurate than randomized 
blocks if the variation among incomplete blocks were as great as that 
within complete replications. With the newer method of analysis this 
disadvantage is largely removed. 


9.32 Analysis with Recovery of Inter-block Information. This analysis 
rests on more difficult theory and will be outlined only in part. Formally, 
the mathematical model is exactly the same: 

Ум = и + Ti + Ta + В; + еда (9.9) 
However, the additional assumptions are made that the block effects 
Bi; are normally and independently distributed with zero Means and 
variance съ”, and that they are independent of the е; It follows from 
the assumptions that for the difference between two observations in the 
same block, the residual variance is 20.2, since the 6;/8 cancel, whereas 
for the difference between two observations in different blocks, the 
residual variance is 2(о,2 + 02). The reader will note the analogy with 
split-plot experiments (section 7.12). With regard to the justification 
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for the additional assumptions, it can be said that in agricultural field ex- 
periments, for which the designs were first developed, it is usually just 
as reasonable to make the assumptions for the 0;; as for the еж. The 
assumptions cannot be taken for granted, and with certain types of data 
we might not wish to make them. 

As a result of the assumptions, observations that are in the same block 
are positively correlated, so that the simplest type of least squares 
estimation cannot be applied. Instead, we may write down the joint 
frequency distribution of all the observations, and use the more general 
method of estimation known as maximum likelihood. By a well-known 
device in theory, maximum likelihood estimation may be shown to be 
equivalent to the minimization of a weighted sum of squares, consisting 
of two parts. The first part is the sum of squares of deviations of the 
residuals from their block means; the second is the sum of squares of the 
residuals of the block totals. The two parts receive weights w and w'/k, 
respectively, where 

TA 3 1 
i $23; 5 аг + Ко? 
For the moment we suppose that w and w’ are known. For the balanced 
lattice, the quantity to be minimized is 


E шури из а — ty) P+ = [Bij ат — kpi — НУР 
(9.10) 


where y;;.,t;;, denote, respectively, the observed mean of a block and the 
mean of the Рз that occur in the block. The first sum is over all observa- 
tions, the second over all blocks. 

When we differentiate with respect to a given t,, we must note that the 
term &;. will contain £j, with coefficient 1/k, whenever the block in ques- 
tion contains the gth treatment. The derivative, when equated to zero 
(omitting a factor —2), gives 


wu 
У ия — Vij — ta + tj] + 2y Ba — km — kp; — kt] = 0 
(9.11) 


The first sum is over the observations that receive the qth treatment; 
the second over the blocks that contain this treatment. The derivative 
contains an additional part arising from the terms іп &;. in the first 
bracket of (9.10), but this contribution will be found to vanish. 

The next step is to simplify (9.11). Both brackets involve a sum of the 
terms ¢,;.- In each case, because of the symmetry of the balanced lattice, 
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the sum contains all other 23 once, and & (k + 1) times. Hence 


(Е + Ты + (rest of t’s) — kt, 
жаы Y -3 
Xu : и 
since the total of all sis zero. Further, we may note that the observed 
block mean y;;, equals B;;/k. Writing as before Т, for the treatment 
total and B, for the total of all blocks in which the treatment appears, we 
have from (9.11) 


B, w' 
v[n.- 2- (Е + Dig + ta] e t — ke 1)m — kt] = 0 
(9.12) 
or 


B 
(kw + w')ty + (k + Пит = wT, — (w — ш) т (9.13) 


The constant m may be shown to be the mean for the whole experiment, 
or G/k?(k + 1). After some rearrangement we may express (9.13) in a 
form comparable with the original estimate given in (9.8), as follows. 


Тұ (w — w^)[kT, — (k + 1)В,-+ С] 
k+1 k(k + 1)(kw + ш”) 


The adjustments are seen to be of the same form in the new as in the 
original analysis. The two are equal when w’ is zero. This occurs when 
с? is very large; that is, when differences among blocks are great. In 
all other cases the adjustments are smaller with the new than with the 
original analysis. They reduce to zero when w' = w, which happens 
only if c;? is zero, or in other words if there are no real differences among 
blocks. Thus the new analysis is a generalization of the original anal- 
ysis, reducing to it in the extreme case where there are marked variations 
among incomplete blocks. At the other extreme, when the arrangement 
into blocks has been ineffective, the new analysis makes no adjustments 
for the non-existent block differences, and in fact reduces to an analysis 
by the method for randomized blocks. 

In practice the weights w and w’ are not known. Yates has shown 
that they can be estimated from the analysis of variance, in the same way 
as we estimate two separate errors in a split-plot experiment. The de- 
tails will not be given. The fact that weights are estimated rather than 
exact introduces some additional sampling variation into the adjust- 
ments. This is unimportant in the larger experiments, but in certain 
of the smaller experiments the weights cannot be estimated accurately 
and the original analysis is recommended. For other accounts of the 


(9.14) 


ит = 
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analysis by different approaches, see references (9.2), (9.3), (9.4), (9.5), 
and (9.4a). 


9.4 Comparison of Incomplete Block and Randomized Block Designs 


So far as the experimental operations are concerned, incomplete block 
designs are no more difficult than randomized blocks. Some extra 
planning is involved in drawing up and randomizing the experimental 
plan, especially if care is taken to make the best possible grouping of the 
experimental units. According to the computing equipment and ex- 
perience available, the time required for the statistical analysis may ex- 
ceed that for randomized blocks by 20 to 150%. 

The gain in accuracy over randomized blocks depends on the type of 
experimental material and may be expected to increase as the number of 
treatments is increased. Most of the incomplete block designs cover the 
range from 6 to 200 treatments, while the cubic lattices extend this range 
to 1000 treatments without requiring a large incomplete block. The 
number of treatments for which a substantial increase in accuracy is at- 
tained must be determined by experience. If the experimental material 
is highly variable and yet lends itself to the formation of small groups 
which are homogeneous, the designs may be advantageous even with 
small numbers of treatments. From the results of varietal trials a num- 
ber of comparisons with randomized blocks have been made: see, for ex- 
ample, references (9.6), (9.7), and (9.8). The experiments varied in size 
from the 3 X 3 to the 11 X 11, and included both lattices and lattice 
squares. They indicated an average gain in accuracy of the order of 
25%. This means that 4 replications of an incomplete block design were 
about as accurate as 5 replications of randomized blocks. 

The ease with which the number of replications can be increased is 
also а factor. The object in the new designs is to obtain the most ac- 
curate comparisons that are possible from a given number of experi- 
mental units. Accordingly, the designs are likely to be most helpful when 
the amount of experimental material or considerations of cost and labor 
force the experiment to be smaller than is desired. Where the number of 
replicates can be increased without difficulty, the experimenter may pre- 
fer some extra replication of a simpler design which avoids the caleula- 
tion of adjustments. 

There is one important property, possessed by many of the designs, 
which increases their attractiveness relative to randomized blocks. Аз 
the plans show, the lattice square and the lattice designs are arranged 
in complete replications as well as in incomplete blocks. A few of the 
balanced incomplete block designs can also be grouped into replications. 
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Such designs may be regarded as randomized block designs which have 
additional restrictions within each replication. It has been shown by 
Yates (9.2) that these designs can be analyzed as if they were ordinary ran- 
domized blocks. This implies that the unadjusted treatment means give 
unbiased estimates of the true treatment effects, and that the Ё- and t- 
tests do not lose their validity. Of course, this analysis will in general 
be less accurate than the complete analysis. 

The question of the validity of t-tests requires discussion. If real 
block differences exist, the true error of the difference between two un- 
adjusted treatment means is smaller for a pair of treatments that occur 
in the same block than for a pair that do not. Hence the randomized 
blocks analysis, which ascribes the same standard error to the difference 
between any pair of treatments, will overestimate the true error for 
some pairs and underestimate it for others. Yates (9.2) shows, however, 
that if an ordinary randomized blocks design is superimposed on the 
same experimental site with the same block effects, the true error of the 
difference still varies from one pair of treatments to another, because 
the randomization places some pairs of treatments in the same block 
and others not. This variation in accuracy appears to be of the same 
order of magnitude for an incomplete block design as for a randomized 
block design. This argument can break down for certain types of com- 
parison. For instance, if we happened to compare the mean of all 
treatments that occur in one block with the mean of all those that occur 
in another block, the standard error of the difference could be substan- 
tially larger than the value obtained from the randomized blocks analysis. 

Thus, if there is any criterion for forming incomplete blocks, an in- 
complete block design is worth a trial in preference to a randomized 
block design which occupies the same set of replications. When the data 
have been collected, the experimenter may choose whether to analyze 
them as randomized blocks or to complete the full analysis, with the 
adjustments for incomplete block variations. In fact, if the variation 
among incomplete blocks is no greater than that within blocks, the com- 
plete statistical analysis reduces automatically to that for randomized 
blocks, as mentioned in section 9.32. 

The analysis as randomized blocks may be useful in experiments where 
several measurements ағе made on each experimental unit. In the for- 
mation of incomplete blocks the units are usually grouped with regard 
to the most important measurement. For certain other measurements 
from the same experiment, the grouping may be less effective; in such 
cases the “randomized blocks" analysis will be satisfactory, as also with 
measurements of subsidiary interest, where the greatest attainable ac- 
curacy is not required, 
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In some types of research an appreciable number of units are likely to 
be injured or destroyed in the course of the experiment, so that they must 
be omitted from the statistical analysis. With incomplete data (except 
where only a few units are missing) laborious computations are required 
to calculate the block (or row and column) adjustments. Consequently, 
in experiments where missing data are of frequent occurrence, incom- 
plete block designs cannot be used to full advantage. Even in this case 
nothing is lost by using an incomplete block design which can be ar- 
ranged in complete replicates. If when the experiment is completed it 
becomes evident that an appreciable number of units must be discarded, 
the experimenter may use the randomized blocks analysis in which the 
extra complication due to missing data is smaller. The same con- 
siderations apply in cases where certain treatments may have to be 
ignored in the final analysis. 


9.5 Comparisons with Other Designs 


Yates (9.1) has discussed three other designs that have been used 
when the number of treatments is large. 


9.51 Systematic Controls. In agricultural field experiments, a control 
variety is sometimes placed at regular intervals over the site, the ex- 
perimental varieties being arranged in complete blocks. From the yields 
of the control plots a fertility index may be calculated for every plot in 
the experiment. These indices are used to adjust the experimental plot 
yields for local variations in fertility. 

The method of systematic controls is very flexible, since it can be used 
with any number of treatments and any number ot replicates. Little 
evidence is available about the increase in accuracy obtained from the 
controls, though it seems probable that the increase is seldom large if the 
extra space occupied by the controls is taken into account. The calcula- 
tion of the best adjustments, as described by Yates (9.1), is rather tedi- 
ous, but if a crude type of adjustment is made most of the potential ad- 
vantages of the method may be lost. 

There may be additional reasons for the presence of extra controls, 
e.g., for their use in an observational scoring of the experimental ma- 
terial. In this connection it should be noted that extra controls can be 


included in an incomplete block design. 


9.52 Random Controls. In another method the treatments are di- 
vided into groups, the grouping remaining the same in all replicates. 
Each group is arranged in a separate randomized block or latin square 
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experiment. In order to obtain comparisons between treatments that 
are in different groups, one or more controls are included in each group 
and randomized along with the other treatments. Before comparing 
treatments that are in different groups, we subtract from each treat- 
ment mean the mean of the controls that are in the same group. Thus 
the controls serve to correct for differences in the fertility of the sites on 
which different groups are tested. 

This design allows considerable flexibility in number of treatments and 
amount of replication and is simple to analyze. From theoretical con- 
siderations it is likely to be inferior in accuracy to a comparable in- 
complete block design, if one exists. Moreover the error variance is not 
the same for all types of comparison. If only one control is included, 
the variance is twice as great for the difference between two treatments 
that are in different groups as for two treatments in the same group. 


9.53 Split-plot Designs. This arrangement is a variant of the previous 
design which avoids extra controls. As in section 9.52, the treatments are 
first divided into а number of groups of equal sizes. Instead of testing 
each group on a separate site, the groups are combined into a single ex- 
periment of the split-plot type. For example, with 25 treatments and 
5 replications, we might first divide the treatments into 5 groups, А, В, 
C, D, and E. The groups are now regarded as whole-plot treatments, 
and could be arranged in a 5 X 5 latin square on plots 5 times as large as 
the basic plots. Within each group the treatments are arranged at ran- 
dom on the individual plots. 

From the analysis for a split-plot design (section 7.15), it will be seen 
that we can test the difference between two treatments whether they 
are in the same group (i.e., whole-plot) or in different groups. Controls 
are no longer needed. The advantages and disadvantages of this ar- 
rangement are in general similar to those of the “random controls": the 
Split-plot design is more accurate if the space that would have been al- 
lotted to the controls is utilized. Sometimes the split-plot design is 
really more appropriate than an incomplete block design because the 
treatments divide themselves naturally into groups, and comparisons 
between members of the same group are regarded as more important 
than comparisons between members of different groups. 


9.6 Choice of Incomplete Block Design 


Е Tables 9.5 (randomized incomplete blocks), 9.6 (lattice squares and 
incomplete and extended incomplete latin squares), and 9.2a (partially 
balanced incomplete block designs) form an index to the incomplete 
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block designs. Each table is arranged by number of treatments (t) 
and number of units per block (k) and shows the numbers of replications 
for which designs are available, with references to the plans. It is hoped 
that the tables provide a rapid means of locating a suitable design, if one 
has been constructed. 

Where the number of treatments and the size of block are fixed in ad- 
vance by the conditions of the experiment, little choice is available to 
the experimenter. In certain types of research, however, both the num- 
ber of treatments and the size of block (or row and column) can be varied 
to some extent without impairing the experiment. 

Where more than one design appears appropriate, a design which can 
be arranged in separate replicates is preferable to one which cannot, and 
a balanced design is preferable to a partially balanced design. These 
recommendations usually narrow the choice to one or two designs. For 
example, with 25 treatments to be compared in 8 replications, we might 
use (see table 9.5) either a balanced incomplete block design in blocks of 
4 units, or a lattice design (partially balanced) in blocks of 5 units. The 
latter design, though not the former, can be laid out in separate repli- 
cates. 

The relative advantages of lattice and lattice square designs can be 
learned only by experience. From the results of a lattice square experi- 
ment we may estimate what the standard error would have been if 
either of the two types of grouping had not been used. Consequently the 
experimenter may test additional methods of grouping by using lattice 
squares. If the extra grouping turns out to be ineffective, the accuracy 
is only slightly less than that of the corresponding lattice design. The 
statistical computations are, however, more laborious for lattice squares, 
since correction terms must be obtained for both row and column effects. 
In general, the lattice squares may be expected to be successful in types 
of experimentation where the latin square has been found superior to 
randomized blocks. 
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TABLE 9.5 DESIGNS ARRANGED IN RANDOMIZED INCOMPLETE BLOCKS 
N Num-| Number of .|Num-| Number of 
UE ber of | replications, Plan ber of| replications, Plan 
T 9! | units P number units pt number 
treat- P or 
МР рег ог 
mend block, hapter 1 block, chapter f 
k | p.b.t | Ба k p.b.f | bal.t 
4 2 3 11.1 4 |2... 5 10.2 
3 3 TH 6 6 11.27 
5 2 4 11.2 6 9 11.28 
3 6 11.14 10 10 11.29 
4 4 11. 3 9 11.30 
6 2 5 11.3 9 9 11.31 
3 5 11.4 10 10 11.32 
3 10 11.5 4 |2,3 10.11 
4 10 11.6 3 10 11.33 
5 5 41, 5 5 11,34 
7 2 6 11.24 7 10 11.35 
3 3 11.7 4 8 11.36 
4 4 11.8 5 |2.. 6 10.3 ( 
6 6 11. 9 9 11.37 
8 2 7 11.9 8 |3 10. 
4 7 11.10 4 9 11.38 
7 7 11. 7 9 11.39 
9 2 8 11.34 5 |2,3 10.12 
$71.28 4 | 101 6 6 | 1140 
4 8 11.11 10 10 11.41 
5 10 11.12 6 |2,3 10.7 
6 8 11.13 9 9 11.42 
8 8 11. 5 10 11.43 
10 2 9 11.14 6 |2,8 10.13 
3 9 11.15 7 |2... 8 10.4 
4 6 | 1116 7.|2,3 10.14 
5 9 11.17 8 8 11.44 
6 9 11.18 4 |3 10. 
9 9 11. 8 |2... 9 10.5 
11 2 10 11.4а 8 |2,3 10.15 
5 5 11.19 9 9 11.45 
6 6 11.20 9 |2... 10 10.6 
10 10 | 11. 9 |2,3 1016 
12 3 2,3 10.10 10 10 11.46 
13 3 6 | 12.21 10 |2,3 10.8 
4 4 11.22 11 |2... 10.(12.7) 
9 9 11.23 5 |3 10. 
15 3 7 | 1124 12 |2,3,4 10.9 
5 7 | 1125 18. | 2-:- 10.(12.8) 
8 8 11.26 


* Ог any multiple of this number. 

tp.b. = partially balanced designs, 
bal. = balanced designs, 

1 References 10., 11., denote chapters, 
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TABLE 9.6 DESIGNS ARRANGED IN LATTICE SQUARES AND INCOMPLETE AND 
EXTENDED INCOMPLETE LATIN SQUARES 


Number] Number of Number] Number of 
Number | of units| replications, Number | of units | replications, 
of treat- per TU Plan of treat- per re Plan 
ments, | со.ай| | number ments, | col. and number 
t row, t row, 
в p.b.¢ | balt k р.Ь. | balt 
3 6,3 6 |218 9 9,4 8 | 13.10% 
9,3 9 |313 9,5 10 | 13.11a 
44 | 5+ 13R+C+ 8,8 | 7+ 13R-C- 
3,5 5 13.16 9,8 8 131 
3,7 7 13.17 8,10 | 9— 13R-C+ 
3,8 8 13.18 10 10,3 9 13.12% 
3,10 | 10 13.19 9,9 8+ 13Һ-С- 
4 4,8 (3) $,6 | 13$ 10, 9 9 13$ 
8,4 8 2L8 9,11 | 10- 13R-C4 
3,5 4- 13R—C+ n 11,2 10 |13134 
4, 5 5 13.20 11,5 5 |133 
5,5 6+ 13R+C+ 11,6 6 13.4 
4,7 7 13.21 10, 10 9+ 18Һ-С- 
4,9 9 13.22 11,10 10 13$ 
5 5,2 4 13.68 12 1111 | 10+ 13R-C— 
5,3 6 |187а 13 13,3 6 [13.148 
4,4 3+ 13R-C- 13,4 4 |135 
5,4 4 |13} 13,9 9 |136 
10,5 10 |213 15 15,7 7 |137 
4,6 5- 13Һ-С- 15,8 8 13.8 
5, 6 6 13.23 16 4,4 5 |122 
6,6 7+ 13R+C+ 16,6 6 13.9 
5,9 9 13.24 16, 10 10 13.10 
6 5,5 4+ 13Һ-С- 19 19,3 9 13.15% 
6, 5 5 |13} 19,9 9 | 13.11 
5,7 6- 13R-C4- 19, 10 10 13.12 
6,7 7 13.25 21 21,5 5 [13.13 
7,7 8+ 13R+C+ 25 | 25,4 8 | 13.168 
7 7,2 6 13.88 5,5 3 12.3 
7,3 3 13.1 25,9 9 13.18 
7,4 4 13.2 31 31,6 6 13.14 
6,6 5+ 13R-C- 31,10 10 |132a 
7,6 6 131 37 37,9 9 13.15 
68 | 7- 13R-—C+ 41 41,5 10 | 13.178 
7,8 8 13:26 49 7,7 3 4 |124 
8,8 9+ 13R+C+ 57 57,8 8 13.3а 
8 7,7 6+ 13R-C— 64 8,8 |3 9 |125 
7,8 7 | 13t 73 | 73,9 9 | 1348 
7,9 8- 13R—C+ 81 9,9 3,4 5 12.6 
9,9 | 10+ 13R+C+ 91 | 91,10 10 | 13.50 
9 9,2 8 | 13.9 121 11,11 |3 в |127 
3,3 (0) 6,4 | 12.1 169 | 13,13 |3 128 
+ = One treatment has one more replicate. * Or any multiple of this number. 
— = One treatment has one less replicate. f p.b. = partially balanced designs. 
R+C+ means one row and one col. added to bal. = balanced designs. E 
at X Llatin square. 3 Constructed from a t X t latin square by 
R—C-+ means one row omitted and one col. omission of last col. 
added. ( ) $ Not enough degrees of freedom for error, 


R—O-— means one row and one col. omitted. 
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TABLE 9.2a PARTIALLY BALANCED INCOMPLETE RLOCK DESIGNS 


Ret- 
cr 
E 
5 з |n. [cioe 5| 8 | 112a | sris ма[в 
° сі” 5 | 10 | zs | srio* | 4 
6| 3 |112| 715° a| 6 
6| 2| 2 | ила |8! e| 6112s | т, Tis ЛЕ 
3| 3 па | в * 6| 6 1122 |sio* e| в 
3| а (а | sre 6| 6 1160 | R132 7| 4 
з | в | 11.20 | 583, ra* 7| 7 пм то" 7| 4 
3] 7 [11 Ви, Ras 8| а [иза |sis* 7| 7 
3/8 8| 8 | 112 | sae 7| в 
4 2 |0 
ti es mI 3| 4 1122 | ЕНІ $4 1 
ДЕ АЯ 542 3| 5 | 1120] R16 B d 
doro Me 3| 5 114a | 1.5 4 
3| в | па | Rize el als 
3| 7| п | Ris 
12а | в5* 4 
8 j ; dx D. з | 7 | 11.60 | Ros ЖЕ; 
ора 3| 8 | 11.2a | rio AR 
dim | onim ИНЕ 
2 . Ы 
tt Clas рси 4| 3 |1124 |8820 Alt ud 
4 | 10 | 112a | ВИ |жана nm 
5| 5 | 11.6a | R108, R109 Е d i| 8 
ДІНДЕ ЕБ a | 6 | 11.20 | snos HE: 
8| 6|1122|80* ЗІН ENTE 5| 6 
8| о | па | Rist e ede 5| 9 
. 5| 5 пла | R116, вит бы 
5] 5| ив | виз 
7 d 
ІНГІ | a па [sns 6 | ine m 
e : б 
ТИЗЕ | МЫШ || Sate 
3| тиза | Rt ыы ed o | 2 
ou: flat тл 6} 6 | 112a | srosesre5*] |10| 2 
3| 8 | па | R57 а 10 | 10 
а 6 | 10 | 11.28 | 5830, БАЗІ * 
7| 7 | на | Rigs, R136 
1 
ate Nee т т | Te [Rz s. || 3] 2 Тан 
5| 5 1120 | LS10* Виз : E se ples 3] 9 
812* > 
в 4 [nz тха в вуза кат 4| 6 [irae | nata 
6| 6 | 11.20 | sis,* 5814" EXE ad sco 4| 7 $56, R37 
7| 7 [116a | Riss На 4| 7 
9| 9| 1129] $31 * AE 
9 | 9 | 11.60 | Riso 
15 
МЕНЕЕ 10] 5 цз ав» ПІРІ кте 
4| 2|uz|mi* Е 7| 7 па | Less 
4| 4 |112a| T2* TI2* 8| 3 | 11.2 | 854 
4| а | 2a|siz* 15 | 3| зиз | с1* 8| 6 | 112a | S55, SRAL * 
4| 8|1122| ки” 3| o | 12 | co* 8| 2 |1122 |557 
5| 3|112a| To 4| 8|122|c2* 8| 8 | 12а |5843 * 
5| 4 [112a 3816 8| 6 11.60 | c23 + 8 | 10 | 112a | $844 * 
5| 5 | 1160 | в14 7| 7 | па | cas* 9| 9 | па | Rist 
5| 6| 12а |5817 10 | 10 | 112a | Cá * 9| 9 | 1162 | 1887 


* Arranged to allow the elimination of variation in two directions; see section 13.44 
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TABLE 9.2a PARTIALLY BALANCED INCOMPLETE BLOCK DESIGNS, (Continued) 
Ref- А Ref- Р Ref- 
e pelte [| Dee) РМЕТ | lene] | Dae 
ence ence кре ence harder 
17 | 4| 8] 1128] C5* 24| 3| 8 | 1123 |5861 32| 4| 8 | 112a | 3874 * 
8| 8| 112 | С6 * 3| 9 | 11.23 | Ra7* 8| 5 | 11.28 | S101 
9| 9 | 1123 | С7* 3 | 10 | 1122 | R48 8 | 7 | 1122 | $102 
4 | 7 | 1122 | R46 8| в | 1.2а | 8875 * 
18| 3| 6 | 11.2 | 8845 * 4 | 10 | 11.23 | R49 * 
3| 9| 12а | R40 5|. 5|112a | R45 * 33 | 6 | 10 | 11.28 | 8104 * 
3| 9| n2» | Rat» 6| 7|112a | S85 7| 7 |112a| RS8* 
4| в | иза | 862 * 6| 8 | 11.2 | $862 * 
6| 4| 112 | S59 8| 5 | 123 | S84 35| 5| 7 | 112a |5876 
6| 5| 1122 | Seo 8| 9 | 11.2a | 5863 5 |10 | 112a | E50 * 
5| 6| 1122 | SR46* 8 | 10 | 1123 | S86 * 7| з | 11.20 | S19 
6| о [1122 |88495 9| 9 | 11.60 | R152 10| 6 |112a | 8105 * 
. 
5 9 M RAM 25| 3| 6 | 1.2 | 1816 * зе | 3| 7 | 1150 | T22 
ail s зч 5| 5 | 11.2a | 5864 * 4| 9] 11.28 | SR78 
Sil се = 5| 7 |11.2a | R51 5| 5 | 112a | L817 * 
oil a а | mens 5| 8 | 112a | R52 8| 2 |112a| T33 * 
ч 10| 4 | 11.20 | 887 * 8| в | 11.20 | 8106 * 
9| 8 | 112a | SR48 
9 | 10 | 112a | 3850 н SANKT үгү ян, 
Pe 26) 4| 8 | 11.20) R53 9| 8 |11.2a | SR77 
10 | 10 | 11.2a | 567 * e| з |122 1814* 
10 | 10 | 11.6a | R154 6| 6 | 11.2 |590,•51.5* |37| 3| 9 | 11.28 | С10* 
8| afia | 580 * 
20| 3| 9 | 11.2 | В42* 8| 8 | 1122 | 801 * 88 | 6| 9 11.28) 8107 ° 
5 | 1122 | SR 
ЕЕ Е 
4 | 10 | 1122 | 3856 * 1 A Lic чылан 
4 | 1122 | SI а 
НЕНІ, а ШЕ |) t| don mann 
6| 9 |112a | 874 * 9| 4 | 11.20 | 802 ils ates M 
8| 4 |[112а|560* 9| 9 |11.2a | SR67* a 
в | 6 |112» | 870 10 | 10 | 11.63 | R155 8 | 9} 11.28 | 8110 
“рү 10 | 7 | 11.28 | S109 
ы 28 | 3} 6 | 16 | T2t 
2 4| 7 |112a | 8868 42| 6| 7 | 11.2» | 5880 
10 | 6 | 11.2a | 871,* 8853 * 4| 8 | 1120] RS * 6 | 10 | 112a | S112 
10 | 8 | 11.2a | 8855 * 2110151328585 10| 5 11122 181011 * 
10n S ERE 7| 2 [112a | T32 10 | 10 | 112a | S113 * 
10 | 10 | 11.2a | SR57 т | в | 11.28 | 8869 
8| 6 | 11.20 | S95 * 44} 8| 10 | 11,2a | S114* 
21| 3| 5 [115a | TI 
3| 7 | 11.20 | SR59 29| 7| 7 | 1.2 | С8* 45| 3| 8 | 11.6 | T23 
3| 9 | 11.20] В44 * s| 8 | 12а | С9* 5| 3| 11.2 | 9144 
6| 2| 11.23 | ТЗ! * 5| 9 | 11.2a | 3881 
6| 6 | 11.28 | 578 * 30| з | 10 | 11.23 | 8873 5 | 10 | 11.28 | R62 
7| 6 | 11.28 | $858 4 | 10 | 11.2з | R7 * 9| 2|112a| T35 
7| 9 | 112a |5860 6| 5 | 1.2а | 8870 9| 7 | 11.20 | 8115 
8| 8 | 11.6 | R146 в| 7 |112a |397 10| 8 | 11.2 | 5116 * 
9| 3 11.2. | 577 * 6| 9 | 1120 | $98 
9| 9 | 11.2 | S80° 6 | 10 | 112a |872 * 48| 6| 8 | 11.20 | 8882 * 
9| 9 | 11.28 | 309 * 7| 7 1122 | R63 * 
22 | 4 | 10 | 1.2 | 882 * 10 | 5 | 11.23 | 996 * 
10| 5 | 11.2з | 881 * 10 | 9 | 11.2а | 5871 49 | 3| 6 | 11.6a | 1.519 
10 | 10 | 112a | 883 * 10 | 10 | 112a | 8100 * 6| 6 | 11.2 | 1918 


* Arranged to allow the elimination of variation in two directions; see section 13.48. 
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TABLE 9.28 PARTIALLY BALANCED INCOMPLETE BLOCK DESIGNS (Continued) 
3 Е N Ref- 

Rel) Design Re | Design Design 
kir| er Fan t|k|r | æ en Elk has | ew Irons) 
ence E ence ence 

7| 7] 11.20 | 8883 * 56| 8| 9 |112s |5121 72| 8| 9| 11.2 | 8890 
7| 9 [1122 | Res 9| 8| 112 | 8889 

7 | 10 | 11.28 | R65 57| 9| 3 [112a |Sr.I8* 
9| 9 112a |8122*81.20*| 80 | 9| 9| 112: | Res 
8| 4 [1122 | StI7* 
8| 8 | 11.28 | S118 * 63 | 7| 9 | 11.3 |5887 81| 9| 9 | 11.28 | SRo1 * 
10 | -6 | 1122 | S117 8| в 112a | noo* 
о 9| 4) 1128 | 81.21 ж |10] 5/112] 81.25 * 
6| 9 | 11.20 | 3884 9 | 10 | 1122 | 8123 10 | 10 | 1123 | 8124 * 
3] 9| па | T% 8| 7| 7/1120] Ls19* 
арти. s] bil fuss | еме 85| 5| 51122 | 8126 
8 | 10 112a | R67 
10 Mu irai зот 100 | 9| о | 11.28 | L20* 
7| 8 |1124 | 8886 70 | 10 | 3 | 11.28 | $1.22 
8| 7 | 11.2a |8885 10 | 6 | 112a | 81.23 * 156 | 6| 68| 11.58 


* Arranged to allow the elimination of variation in two directions; see section 13.4а. 


9.1 
9.2 
9.3 
9.4 


9.5 
9.6 


9.7 


9.8 
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CHAPTER 710 


Lattice Designs 


10.1 Balanced Lattices 


10.11 Description. The number of treatments must be an exact 
square. The size of block is the square root of this number. Incomplete 
blocks are combined in groups to form separate replications. The special 
feature of the balanced lattice, as distinguished from other lattices, is 
that every pair of treatments occurs once in the same incomplete block. 
Consequently, all pairs of treatments are compared with the same degree 
of precision. 

The numbers of replications are rather severely restricted, as well as 
the numbers of treatments. The useful plans are indexed below. Bal- 


BALANCED LATTICES CONTAINED IN PLANS 10.1-10.6 


Number of treatments 9 16 25 49 64 81 
Units per block 3 4 5 7 8 9 
Replicates 4 5 6 8 9 10 


anced lattice designs cannot be constructed for 36 treatments, and none 
has been found for 100 or 144 treatments: 


10.12 Statistical Analysis. The analysis is comparatively easy. It 
will be illustrated by an experiment on the effects of 9 feeding treatments 
on the growth rates of pigs, conducted by the North Carolina Agricul- 
tural Experiment Station. The analysis has been described in more de- 
tail by Comstock et al. (10.1), and the nutritional information obtained 
from the results by Peterson её al. (10.2). 

For pigs of a given breed, previous experience indicated that a con- 
siderable part of the variance in growth rate between animals can be 
ascribed to the litter. Hence the experiment was planned so that litter 
differences would not contribute to the intra-block error. The pigs were 
divided into sets of 3 litter-mates. Two sets of 3 were assigned to each 
block. Within a block, each treatment received one member of each set. 
Thus the experimental unit was composed of 2 pigs each feeding in the 
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same pen. The plan and growth rates are given in table 10.1; the plan 
has been rearranged so that it follows the same pattern as plan 10.1. 


TABLE 10.1 GAINS IN WEIGHT (POUNDS PER DAY) FOR A TOTAL OF 2 PIGS 


Rep. I 
Block 
2 @ (2) (3) 
2.20 1,84 2.18 


Qo 0 © (9 
2.05 0.85 1.86 


з) @ (8) (9) 
0.73 1.60 1.76 


Вер. Ш 


oOo о (9 
1.81 1.16 1.11 


я 0 0 Q0 
1.76 2.16 1.80 


o9 GQ wo 8 
1.71 1.57 1.13 


Totals 


4.08 


5.72 


4.41 


214.21 


Block 
(4) 


(5) 


(6) 


(10) 


(11) 


(12) 


Rep. II 
(1) (4) (7) 
119 1.20 1.15 
(2) (5) (8) 
2.26 1.07 1.45 
(3) (6) (9) 


2.12 2.03 1.63 


Rep. IV 
(0 (0 (8) 
1.77 1.57 1.48 
о o 9 
1.50 1.60 1.42 
з 6 (Q0 


2.04 0.93 1.78 


Treatment totals and adjustment factors 
Adjusted total ^ Mean per 


I В W-(8T-4B,.4G) T-4,W 

H 6.97 18.61 +3.89 7.21 
2 7.36 21.24 —5.46 7.02 
3 8.05 21.16 —3.07 7.86 
4 6.42 17.23 +7.76 6.91 
5 4.01 18.37 —4.03 3.76 
6 7.62 21.03 —3.84 7.38 
7 5.46 18.10 +1.40 5.55 
8 5.61 18.05 +2.05 75.74 
9 5.92 18.47 +1.30 6.00 

G = 57.42 172.26 0.00 57.43 


Totals 


3.54 


4.78 


unit 


The steps in the analysis are as follows. The algebraic formulae refer 
to à k X k lattice in blocks of k units, with r = (k + 1) replicates, 

1. Calculate the block totals, the replication totals, the grand total G, 
and the treatment totals Т, shown under the plan. 

2, For each treatment, calculate the total B, for all blocks in which 
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the treatment appears. For treatment 4, this is 
B, = 4.76 + 3.54 + 4.41 + 4.52 = 17.23 


Аз a check, the total of the B; values should be k times the total of the T 
values. 
3. Compute the quantities 


W-kT — (&Е+1)В,+ © 
whose sum should be exactly zero. 

4. The analysis of variance is now obtained. The total s.s. and the 
sums of squares for replications and treatments are found in the usual 
way. The sum of squares for blocks within replications, adjusted for 
treatment effects, is 


Sw? (8.89)? + (6.46)? +... + (1.30)? _ 


PEF 108 ме 
5. Calculate the adjustment factor 
Ey — Е, 0.1776 — 0.0773 
кт ( ь А к = 0.0628 


BE, 9х0.1776 


where Es and E, are the blocks and intra-block m.s., respectively. The 
adjusted treatment total is (7 + АТУ) as shown in table 10.1. То avoid 
confusion, the adjusted means are shown per unit (total of 2 pigs) al- 
though a mean per pig would be more natural. If E, is less than Ев, в 
is taken as zero, and no adjustments are applied to the treatment totals. 


TABLE 10.2 ANALYSIS OF VARIANCE FOR TOTAL GROWTH RATE OF 2 PIGS 


d.f. 
ma 
General k=3 8.8. та. 

Replications k 3 0.0774 
Treatments (2-1) 8 3.2261 
Blocks (adj.) (8-1) 8 1.4206 0.1776 
Intra-block error (6 = 1002-1) 16 1.2368 0.0773 

Total @+R-1) 35 5.9609 


6. For t-tests, calculate the effective error m.s. 
E, = Е.(1 + ku) = 0.0773(1 + 3 X 0.0628) = 0.0919 


The purpose of the adjustment factor is to increase E, so as to take ac- 
count of sampling errors in the block correction values ,W. The ordi- 
nary rules for the calculation of t-tests may now be applied to E’. Thus 
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the variance of the difference between two adjusted treatment totals is 
2rE,, while that for the difference between two adjusted treatment 
means is 2E,'/r, or 0.0460. The standard error of this difference is 0.214. 
If means per pig were taken, the standard error would be 0.107.* 

7. The treatments m.s. in table 10.2 cannot be tested against the 
intra-block error m.s., since the former contains some block effects. For 
an approximate F-test, calculate the sum of squares of deviations of the 
adjusted treatment totals, which comes to 12.6771. This is divided by 
г or 4 to bring to a single-unit basis, and by (К? — 1) or 8 to obtain the 
mean square, 0.3962. This is tested against the effective error m.s., 
0.0919. The F-ratio, 0.3962/0.0919, or 4.31, has 8 and 16 d.f. 

8. In order to estimate the precision relative to randomized blocks, 
pool the mean squares for blocks (adjusted) and the intra-block error. 
The result is 0.1107, with 24 d.f., and is an unbiased estimate of the error 
variance that would have been present if the experiment had been ar- 
ranged in randomized blocks. This figure is compared with the effective 
error m.s., 0.0919. The relative precision is 0.1107/0.0919, or 120%. 
This means that 4 replications of the balanced lattice appear to have 
been slightly less accurate than 5 replicates of randomized blocks. 

These designs may be used in factorial experiments where it is desired 
to avoid any sacrifice of replication on the interactions relative to that 
on the main effects. Some of the combinations of levels that may be 
tested in balanced lattices are: 3 X 3,8 X 2,4 X 4, 4 X 2 X 2, 24,5 X 
5,7 X 7,8 X 8, 43, 29, 9 X 9, 34. In analyzing a factorial experiment, 
first carry through the procedure described in this section. The sum of 
squares for deviations of the adjusted treatment totals is then divided 
into the sums of squares for main effects and interactions in the usual 
way; a divisor r must be introduced in order to convert these sums of 
squares to a single-unit basis. The resulting mean squares are tested 
against the effective error m.s., Е.(1 + ku), which has (k — 1) (k? — 1), 
or in this case, 16 d.f. 


10.12а Effects of Errors in the Weights. Тһе fact that the recovery 
of inter-block information is based upon estimated weights has two prin- 
cipal consequences. There is a loss of precision in the sense that the 
estimates are less precise than they would be if the weights were known 
exactly. The magnitude of this loss has been investigated by Yates 
(10.12), Cochran (10.5), and Kempthorne (10.1a). Their results sug- 
gest the conclusion that the loss is unimportant if there are at least 
10 d.f. for estimating the mean square for blocks. : й 

A greater source of disturbance, more difficult to investigate, is that 
the formula for the standard error of the difference between two ad- 


* If wis taken ав zero, use E, as the effective error m.s., rather than the pooled m.s. 
for Е, and Ee. This-practice is recommended for future designs also. 


400 LATTICE DESIGNS 10.124 


justed treatment means in an underestimate. The bias arises from two 
sources—from the assumption that true weights were used and from the 
substitution of estimated for true weights when the formula is computed 
in practice. For the 5 X 5 simple lattices, Meier (10.2a) has computed 
the bias in the average variance of the difference between two treatment 
means. His investigation also throws light on the question whether we 
ought to use Е, as the effective error mean square when и is taken as 
zero, as recommended in the footnote to the previous page, or the pooled 
mean square for E; and E, in this situation. Results for both estimates, 
V, and Vp, say, are shown below for a range of values of w/w’. 


BIAS OF VARIANCE ESTIMATES EXPRESSED AS PERCENT ERRORS 


w/w' 1 2 3 4 5 6 7 8 9 
Ve +114 -50 -74 -73 -68 -61 -55 -50 -46 
Vp —62 -94 -90 -80 -71 -63 -56 -50 —46 


Use of E, when и is zero produces a smaller negative bias over most of 
the range of w/w’ but results in a positive bias when w/w’ is near 1, 
i.e., when there is little or no block variance. Although overall there is 
little to choose between the two estimates, V, appears less subject to 
bias on the average. 

With V,, the results suggest an average underestimation of about 6% 
in the variance, or 3% in the standard error. For 3 X 3 and 4 X 4 lat- 
tices, we may speculate that the standard error may be too low by as 
muchas 10%. For6 X б and larger lattices, the underestimation should 
be negligible. It would be desirable to develop a correction term in the 
standard error formula which removes most of the bias, but further work 
on this remains to be done. 


10.13 Missing Data. Since lattice designs are often used in large ex- 
periments, involving perhaps several hundred observations, it is not easy 
to ensure that all the observations are accurately made. Even with care- 
ful management of the experiment, there is always a chance that mistakes 
or accidents will affect a few of the observations. Consequently, missing 
data tend to be more common with lattices than with small experiments. 

Methods for the analysis of the results of lattice experiments with in- 
complete data have been developed by Cornish (10.3), As might be ex- 
pected, the computations are lengthy. They become simpler in two 
special cases. The first arises when the incomplete blocks are ineffective, 
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so that the analysis reduces to a “randomized blocks” analysis. In this 
event values are substituted for the missing observations by the formula 
which applies to randomized blocks. The second case occurs at the other 
extreme when the variation among blocks is so large that the inter-block 
information is negligible. Here the correct procedure is to insert values 
for the missing observations by minimizing the intra-block error 
m.s. 

Since the general procedure must reduce to these two special pro- 
cedures in the appropriate cases, it might be anticipated, as Cornish’s 
solution shows, that both the “randomized blocks" and the “іпіга- 
block” estimates for the missing observations are required. Thus we 
have to find two estimates for each missing value. Similarly, two anal- 
yses of variance are necessary, one to obtain the correct value for the 
intra-block error m.s. and one for obtaining the mean square among 
blocks. 

In an attempt to reduce the amount of arithmetic, some investigation 
has been made of the consequences of using a single estimate and a single 
analysis of variance. For this purpose the ''intra-block" estimate seems 
the better of the two. It gives the correct intra-block error m.s.; the 
block m.s. is in general slightly too high. It provides an excellent ap- 
proximation when the blocks are effective and is at its worst when blocks 
are ineffective; that is, when a “randomized blocks” analysis should have 
been used. In the latter case it still gives unbiased estimates of the 
treatment means, but they are not as accurate as the estimates obtained 
by the use of the “randomized blocks” formula. The chief defect of the 
"randomized blocks" formula is that when block variation is large it 
tends to give an overestimate of the intra-block error m.s., and some- 
times the bias is substantial. For this reason the “randomized blocks" 
estimate is considered more hazardous for general use, despite its greater 
simplicity. 

The procedure that is given for the incomplete block designs in this 
and succeeding chapters is to insert values for the missing observations 
by means of the “intra-block” formula. Thereafter the analysis is con- 
ducted in the usual way for complete data, except that in the analysis 
of variance 1 d.f. is subtracted from the intra-block error for each miss- 
ing observation. This method, or the more accurate Cornish method, 
should be used whenever it is intended to recover inter-block information. 
If many observations are missing, or if it is evident on inspection that 
blocks are relatively ineffective, the experimenter may decide at the start 
to analyze the data as a randomized block experiment. In this case sub- 
stitutes for the missing values should of course be obtained by the 
formula for randomized blocks (section 4.25). 


402 LATTICE DESIGNS 10.13 


The “intra-block” formula for the estimate of a missing observation 
ina k X k balanced lattice with (k + 1) replications is as follows. 
ЮТ +kk+DB-R+G—kT, — ЕВ, 
"T к — 1)? 
In this formula, Т, B, and R denote as usual the totals for the treatment, 
block, and replication which contain the missing observation, while G is 
the grand total. Also 
Т, = total (over all replications) of all treatments that appear in 
the block which has the missing observation. 
B, — total of all blocks in which the treatment with the missing 
observation appears. 


(10.1) 


Example. Suppose that the observation 2.20 for treatment 1 in rep- 
licate 1 of table 10.1 had been missing. Form the block, replicate, treat- 
ment, and grand totals, just as in the ordinary analysis. It is helpful to 
insert an z for the missing observation and to include this z in all totals 
where it should appear. When the value of z has been found, it can then 
be inserted in all the appropriate places and the data are ready for com- 
puting the analysis of variance. The list of treatment totals appears as 
shown below. 


T 
н 


осо оюк 
е Ee оқ-ары 
бевававнс зз 


© 
| 
8 
8 
+ 
a 


The quantities needed are 
T=477, В-40, R=1287, G = 55.22 
Ть = 4.77 + 7.36 + 8.05 = 20.18 
B, = 4.02 + 3.54 + 4.08 + 4.77 = 16.41 
Hence 
, 090677) + (12)(4.02) — 12.87 + 55.22 — (8)(20.18) — (3)(16.41) 
12 


= 1.98 
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The analysis of variance is now computed in the usual way, except 
that the total degrees of freedom are reduced from 35 to 34 and the 
intra-block error degrees of freedom from 16 to 15. The same adjust- 
ment factor » may be used (there is actually a slight change which can 
be ignored). 

When several observations are missing, the values are inserted by the 
method of successive approximation (illustrated in section 4.25). The 
exact formulae for t-tests are complicated. The following approximate, 
rule assigns an effective number of replicates to each of two treatments 
A and B whose means are being compared. In any replicate of the ex- 
periment, A is credited with 0 replication if it is absent; with 0 replica- 
tion if A is present but В occurs in the same block and is absent; and with 
1 replication otherwise. The same rule is applied to B. For instance, in 
а t-test of the difference between treatments (1) and (3) in table 10.1, 
treatment (1) is credited with 3 replications and treatment (3) with 3 
replications (it loses 1 replication because in the first replicate of the ex- 
periment treatment (1) is in the same block and is missing). On the 
other hand, in comparing treatment (1) with treatment (4), the effective 
numbers of replications are 3 and 4, respectively. 


10.2 Partially Balanced Lattices 


10.21 Simple Lattices (Two Replicates). For an experiment with 2 
replicates, use the first 2 replicates of the appropriate set shown below 
(plans 10.1-10.6). Notice that designs are available for 36, 100, and 
144 treatments (plans 10.7-10.9) as well as for those numbers of treat- 
ments for which balanced designs are given. Plans for 121 and 169 
treatments can be taken from plans 12.7 and 12.8. 

The asymmetry of the designs is apparent from the plans; thus, with 
9 treatments (plan 10.1) the first treatment appears in the same block 
as treatments (2), (3), (4), and (7), but not in the same block as any of 
the remaining treatments. 

For 9 and 16 treatments, simple lattices are unlikely to be more ac- 
curate than randomized blocks unless the variation among incomplete 
blocks is great compared with that within incomplete blocks. Further, 
the numbers of degrees of freedom for estimating the error are only 4 and 
9, as against 9 and 16, respectively, for randomized blocks. 


10.22 Triple Lattices (Three Replicates). The first 3 replicates of 
plans 10.1-10.9, 12.7, and 12.8 are used. Designs may be obtained for 
all squares from 9 to 169. With 9 treatments the precaution mentioned 
in section 10.21 applies also to the triple lattice. 
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10.23 Four Replicates. These designs may be obtained either (i) by 
duplicating the simple lattice or (ii) by using a quadruple lattice, i.e., 
the first 4 replicates of the plans. With 36 or 100 treatments, only the 
first method can be used, since no quadruple lattice exists. The second 
procedure is slightly preferable, because the resulting design comes 
closer to symmetry, but the statistical analysis requires more time. 


10.24 Five Replicates. The first 5 replicates from the balanced set 
are taken. Designs are not included for 9, 36, 100, or 144 treatments. 


10.25 Higher Numbers of Replicates. Balanced designs should be 
used for the following numbers of treatments and replicates: 25, 6; 49, 
8; 64, 9; and 81, 10. For 9 treatments іп 8 or 12 replicates, 16 treatments 
in 10 replicates, and 25 treatments in 12 replicates, the plan for the bal- 
anced design should be repeated. 

In other cases the following recommendations are made. 


біш replicates. Use the triple lattice twice. 

Eight replicates. Use the simple lattice four times or the quadruple 
lattice twice. 

Nine replicates. Use the triple lattice three times. 

Ten replicates. Use the simple lattice five times or the quintuple lattice 
twice. 


The designs recommended are not always as fully balanced as they 
might be made. For instance with 6 replicates of 16 treatments the bal- 
anced design (5 replicates) plus 1 extra replicate gives a more nearly 
symmetrical arrangement than the triple lattice used twice. Since 
separate computing instructions would be required for such plans, we 
have preferred to adhere to a more uniform system. For this reason no 
account is given of designs with 7 replicates, though their statistical 
analysis presents no great difficulty to the reader who has mastered the 
principles. 


10.26 Arrangement of Experimental Material. In the arrangement of 
а group of experimental material so as to apply one of those designs, the 
most important criterion is that units within the same incomplete block 
be homogeneous. In field trials, for example, if the plots are oblong the 
usual procedure is to һауе the incomplete blocks as nearly square as 
practicable, the plots extending the whole length of one side of the block. 
Uniformity trial investigations have shown that this layout gives on the 
average the most homogeneous block. In cases where there is detailed 
knowledge of the experimental site, some other method of grouping may 
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be superior. Efforts should be made to keep the experimental technique 
uniform for all units in the same block. Changes in technique that are 
necessary should be made when changing from one block to another. 

When the blocks have been formed, there is a secondary advantage in 
forming replications so that blocks within the same replicate are as sim- 
Паг to one another as the material permits, since this increases the pre- 
cision of inter-block comparisons. If the full statistical analysis is carried 
out, it is more important to have the incomplete blocks homogeneous 
than to have the replications homogeneous. Thus, when the grouping 
of blocks into compact replications is troublesome, this criterion may 
be ignored without much loss of precision. 

On the other hand, it is desirable to have homogeneous replications if 
the data may subsequently be analyzed as a randomized block design. 
Аз we have pointed out, the randomized block analysis may be needed 
in fields of work where certain experimental units or whole treatments 
are likely to be destroyed during the course of the experiment (e.g., in 
pasture plots subject to winterkill). 


10.27 Randomization. The randomization consists of three steps. 

1. Randomize the blocks separately and independently within each 
replication. 

2. Randomize the treatments separately and independently within 
each block. 

3. Allot the treatments to the treatment numbers at random. 

For methods of randomization, see chapter 15. 

Steps 1 and 2 give each treatment an equal chance of being allotted to 
any experimental unit. These steps correspond to the allotment of 
treatments to units at random in an ordinary randomized block design. 

The function of step 3 is to decide by random choice which groups of 
treatments will form the blocks of the design. If differences among blocks 
are large, the error variance per plot for the mean of а group of treat- 
ments which lie in the same block may be considerably higher than the 
average error variance. This additional randomization ensures that the 
average error variance may be used, in nearly all cases, for comparisons 
among groups of treatments. For further discussion, see reference (10.4). 

When a plan is repeated in order to obtain extra replications, а separate 
randomization must be made for every replicate. 


10.28 Statistical Analysis. An account of the theory, with worked 
examples of the simple and triple lattice, is given in reference (10.5); a 
systematic presentation of the shortest computational methods, with 
worked examples of the simple, quadruple, and balanced lattices in 


406 


LATTICE DESIGNS 


10.28 


(10.6); and a detailed description of the methods for carrying out the 
computations on I.B.M. (Hollerith) punched-card machines in (10.7). 
The computational methods differ slightly from one reference to another, 
but all are basically the same. 

Table 10.3 shows the plan and yields for a 5 X 5 simple lattice ex- 
periment on soybeans, the treatments being 25 varieties produced in the 


TABLE 10.3 Үгез оғ А 5 X 5 SIMPLE LATTICE EXPERIMENT ON SOYBEANS 


() 6 
(6) 16 
(11) 17 
(16) 18 
(21) 14 


(1) 24 
(2) 21 
(8) 16 
(4) 17 
(5) 15 


(1) 30 
(6) 29 
(11) 41 
(16) 29 
(21) 22 


uC —1.4 


(1) 38.1 
(6) 26.3 
(11) 47.1 
(16) 25.3 
(21) 23.3 


(Yields are in bushels per acre, minus 30 bu.) 


@ 7 
(7) 12 
(12) 7 
(17) 16 
(22) 15 


(6) 13 
пи 
(8) 4 
(9) 10 
(10) 15 


Rep. I 
(3) 5 
(8) 12 
a3) 7 
(18) 13 
(23) 11 


Rep. II 


(11) 24 
(12) 14 
(13) 12 
(14) 30 
(15) 22 


(4) 8 
(9) 13 
(14) 9 
(19) 13 
(24) 14 


(16) 11 
(17) 11 
(18) 12 
(19) 9 
(20) 16 


"Treatment totals (unadj.) 


B 
(5) 6, 32 
(10) 8| 61 
(15) 14 | 54 
(20) 14 | 74 
(25) 14 | 68 
289 
(21) 8| 80 
(22) 23 | 80 
(23) 12 | 56 
(24) 23 | 89 
(25) 19 | 87 
392 
uC 
(5) 21 | +9.5 
(10) 23 | —1.3 
(15) 36 | +7.5 
(20) 30 | —2.3 
(25) 33 | +27 
—4.8 0.0 


(4) 29.5 (5) 25.7 
(9) 167 (10) 16.9 
(14) 41.5 (15) 387 
(19) 147 (20) 22.9 


(2) 28 (3) 21 (4) 25 
(7) 23 (8) 16 (9) 23 
(12) 21 (13) 19 (14) 39 
(17) 27 (825 (19) 22 
(22) 38 (23) 23 (24) 37 
—3.6 -13 —5.0 
Treatment totals (adj.) 
(2) 33.9 (3) 29.2 
(7) 18.1 (8) 13.4 
(12) 249 (13) 25.2 
(17) 211 (18) 214 
(22) 37.1 (23) 244 


(24) 847 (25) 30.9 


soybean breeding program of the North Carolina Experiment Station. 
The experiment actually contained 4 replications, obtained by duplicat- 
ing the plan for the simple lattice. The first 2 replications will serve to 
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illustrate the analysis for simple, triple, and quadruple lattices, which all 
follow the same general pattern. In section 10.29 the data from the 
whole experiment will be analyzed in order to indicate the procedure 
when the basic plan is repeated. 

1. Find the block totals B, the replication totals, the treatment totals, 
and the grand total. 

2. For each block calculate 


C = total (over all replicates) of all treatments in the block — 7B 
For example, for block 1 in replication 1, 
C = 30 + 28 + 21 + 25 + 21 — (2)(82) = +61 


Find the replicate totals №, of the C values. These replicate totals 
should add to zero. 
3. The analysis of variance is as follows. 


d.f. 8.8. m.s. 
Replications (r—-1) 1 212.18 
Treatments (unadj.) (k — 1) 24 559.28 
Blocks within replications (adj.) r(k — 1) 8 501.84 62.73 Еь 
Intra-block error (k — 1k — k — 1) 16 218.48 13.662, 
Total (ri? — 1) 49 1491.78 


All sums of squares are found in the usual way except that for blocks 
(adjusted), which is given by 

Xo ERr? P+ OP +--+ BY? _ (103)? + (108)? 
re) rr —1) 10 50 

= 501.84 

where R, denotes a replication total of the C’s. | 

4. The weighting factor used to obtain the adjusted treatment totals is 

(Es — Ee) _ (62.73 — 13.66) 2d 

7—1 (5)(62.73) 
where E and E, are, respectively, the mean squares for blocks and intra- 
block error. If Ey is less than Е», the factor is taken as zero and no ad- 


justments are made for block effects, the experiment being analyzed as if 
in randomized blocks. 

Each С value is now multiplied by » to obtain the block corrections 
uC. These should sum to zero, apart from rounding errors. Hach treat- 
ment total is adjusted by applying the appropriate correction for every 


1564 


Г? 
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block in which the treatment appears. For example, the adjusted total 
for treatment 4 is 
25 + 9.5 — 5.0 = 29.5 


In the case of the simple lattice it pays to write the uC values around the 
table of unadjusted treatment totals, putting the values for replication 
1 in the right-hand column and those for replication 2 in the bottom row. 
Then the adjustments to any treatment total are at the end of the row 
and the bottom of the column in which the treatment lies. 

5. The error variance of the difference between two treatment means 
is slightly smaller for treatments that appear in the same block than for 
those that do not. The formulae are: 


Two treatments in the same block: 


= [+ (r — 14 = кысыл + 0.1564] = 15.80 
Two treatments not їп same block: 
= П + ra] = 20350 [1 + (2) 0.1564)] = 17.93 
.2Е, rku | _ (2)(13.66) (2)(5) (0.1564) А 
Average: = Е 25%! Е [+ г ] 17.22 


The corresponding standard errors are 3.97, 4.23, and 4.15, respec- 
tively. Except with small designs it is sufficient to use the average value, 
4.15, for all t-tests between pairs of treatments. 

6. The analysis of variance does not supply an F-test of the adjusted 
treatment totals. A test of the unadjusted treatment totals can be ob- 
tained by analyzing the data as if the experiment were in randomized 
blocks. The error m.s. is the pooled mean square for blocks and intra- 
block error, as shown below. 

d.f. 8.8. m.s. 
Treatments (unadj.) (2-1) 24 559.28 23.30 
Ептог (5% — Yer — 1) 24 720.32 30.01 


This test is not fully sensitive, since it is based on unadjusted treatment 
totals, but it will be sufficient in cases where an F-test is of secondary in- 
terest or where differences among incomplete blocks are small. 

An F-test of the adjusted treatment totals requires a little more cal- 
culation. This is most easily done by changing the unadjusted treat- 
ments s.s. (which is already available) so that it may be tested against 
the intra-block error m.s. Calculate B,, the unadjusted sum of squares 
for blocks within replications, which comes to 350.00. If Ва is the ad- 
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justed sum of squares for blocks within replications (501.84), we sub- 
tract the following quantity from the treatments s.s. 


"d 


— (5)(1)(0.1564) | | (350.00) — (501.84)| = —85.30 


1 + (5) (0.1564) 
In this case the subtraction term is negative, which is rather unusual. 
We add 85.30 to the unadjusted treatments s.s. The F-test is then com- 
pleted by the following analysis. 


d.f. 8.8. m.s. 


Treatments 24 64458 2686 
Intra-block error 16 218.48 13.66 


The F-ratio, 1.97, lies between the 10 and the 5% levels. The F value 
differs considerably from that obtained by the randomized blocks test, 
as may happen when the adjustment for incomplete block effects results 
in a substantial increase in precision. 

7. The error m.s. in the randomized blocks analysis was found to be 
30.01. To estimate the gain in accuracy over randomized blocks, we 
compare this figure with the effective error variance 


Е, Е ТЕ а-ы - (13.66) [! + 990159)! = 17.22 


The relative accuracy is 30.01/17.22, or 174%. 


10.29 Statistical Analysis for Repetitions of the Designs. Suppose 
that the basic design contains n replicates (where, e.g., n = 2fora simple 
lattice) and that this is repeated p times, so that the total number of 
replications is г = np. For illustration, we present a joint analysis of 
the data in tables 10.3 and 10.4 (pp. 406 and 412), which show 4 repli- 
cations of the 5 X 5 simple lattice. Replication III is à repetition of 
replication I, and IV of II. The field results have been rearranged so 
that treatments follow the same order within corresponding replications. 
In terms of our notation, п = 2, р = 2, andr = 4. 

1. Calculate the block totals, the replication totals, the treatment 
totals (shown in table 10.4), and the grand total. 

2. Arrange the block totals in supplementary tables as on the right of 
table 10.4. In these tables, blocks that contain the same set of k treat- 
ments are placed in the same row. Each table has К rows and р columns, 
while the number of tables ism. Form the row and column totals of each 
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table. For each group of similar blocks, compute the quantities 


C = total (over all replicates) of the treatments appearing in the 
group — n(total of the group of blocks) 


Thus for the first block in replications II and IV, which contains treat- 
ments (1), (6), (11), (16), and (21), 
C = 59 + 51 + 80 + 63 + 58 — 2(157) = —3 


Find the replication totals Re of the C values; these should add to zero. 
3. The analysis of variance is as follows. 


d.f. 8.8. m.s. 
Replications (т — 1) 3 226.19 
Treatments (unadj.) (k? — 1) 24 791.24 
Blocks within replications (adj.) rík — 1) 16 786.00 49128 
Component (а) n(p — 1)(k — 1) 8 164.72 
Component (b) n(k — 1) 8 621.28 
Intra-block error (k — 10% — k — 1) 56 761.56 13.608, 
Total (rk? — 1) 99 2564.99 


The blocks s.s. contains two components, both of which are obtained 
from the supplementary table of block totals. 

Component (a) is a new component, which arises only when the design 
has been repeated. It is composed of the differences between the totals 
of blocks that contain the same set of treatments. Since each supple- 
mentary table has k rows and p columns, it may be analyzed as follows. 


9.4. 
Rows (k —1) 
Columns (p — 1) 
Rows X columns (К — 1)(p — 1) 


Total (kp — 1) 


The sum of squares for component (a) is the sum of the rows X columns 
interactions over all » tables. It may be obtained from the following 
calculations. 


Total: (82)? + (61)? +- - -+ (75)? + (84)? — (650)? + (749)! 


TIE = 80 = 602.18 
. (104)? + (142)? +-.-+ (171)? (650)! + (749)? 
Rows: pk = 10 - 56 = 309.28 
. (289)? + (361)? + (392)? + (357)? _ (650) + (749)? 
Columns: a aa mre ae 128.18 


Component (a) s.s. = 602.18 — 309.28 — 128.18 = 164.72 
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The above is a general method that covers all cases. With a simple 
lattice and p = 2, it is quicker to calculate the differences between the 
totals of similar blocks. The sum of squares of deviations of these dif- 
ferences from their replication means, divided by 2k, gives the sum of 
squares for component (а). 

Component (b) is the component that is present even when there are 


no repetitions. 
2467 LR? 
kín —1) k?r(n — 1) 


(63)? + (29)? +- -++ (48)? " (99)? + (99)? 
я 20 100 


Component (b) s.s. = 


= 621.28 
The sum of squares for blocks within replications (adjusted) is the pooled 
sum of squares for the two components. All other terms in the analysis 
of variance are computed in the usual way. 

4. If Ey is the pooled mean square for blocks, and Е, that for error, 
the weighting factor is 

E pE — Ee) мо} ros (Q)(BB.52) 

“MG — PBs + @— DE] (52) 49.12) + 13.60) 
where # will be recalled that р is the number of repetitions of the basic 
design, in this case 2. Аз usual, no adjustments are made if Еь is less 
than Ee. 

The block corrections „C are obtained and entered in table 10.4; they 
should add to zero except for rounding errors. Each treatment total is 
adjusted by applying a correction for each group of blocks in which the 
treatment appears. For instance, treatment (13) appears in the third 
block in replications I and III and in the third block in replications II 
and IV. Its adjusted total is 


44 + 8.8 — 0.1 = 52.7 


5. The error variances for the difference between two treatment means 
are: 


= 0.1270 


Two treatments in the same block: 
.60; 
2Е, | (a — тушр = EESO p 4 01270] = 770 


4 
T 
Two treatments not in same block: 
.60) 
2, uj" OO ) t + @(0.1270)] = 8.56 


T 


.2E, nku (2)(13.60) (2)(5)(0.1270) =8.28 
Average: e - а рК е. ] 
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TABLE 104 A 5X 5 SIMPLE LATTICE (reps. III anp IV) 


Block totals 
Rep. Ш I ІП Sum С uC 


(1) 13 (226 (3 9 (4 13 (5) 11 | 72 32 72 104 +63 4-80 
(6) 15 (7) 18 (8) 22 (9) 11 (10) 15| 81 61 81 142 —29 — 3.7 
(11) 19 (12) 10 (13) 10 (14) 10 (15) 16 | 65 54 65 119 +69 + 8.8 
(16) 21 (17) 16 (18) 17 (19) 4 (20) 17| 75 74 75 149 —34 - 43 
(21) 15 (22) 12 (23) 13 (24) 20 (25) 8 | 68 68 68 136 +30 + 38 


361 289 361 650 +99 +12.6 


Rep. IV П IV Sum С uC 


(1) 16 (6) 7 (11) 20 (16) 13 (21) 21 | 77 80 77 157 -3 
(2) 15 (7 10 (12 11 (17) 7 (22) 14 | 57 80 57 137 +2 +03 
(3 7 (8 11 (13) 15 (18 15 (23) 16 | 64 56 64 120 — 1 
(4 19 (9) 14 (14) 20 (19) 6 (24)16| 75 89 75 164 —49 — 62 
(5) 17 (10) 18 (15) 20 (20) 15 (25) 14| 84 87 84 171 —48 – 61 


357 392 357 749 —99 —125 


"Treatment totals (4 reps.) 
(1) 59 (2 69 (3) 37 (4) 57 (5) 49 +80 
(6) 51 (7) 51 (8) 49 (9) 48 (10) 56 —3.7 
(11) 80 (12) 42 (13) 44 (14) 69 (15) 72 +88 
(16) 63 (17) 50 (18) 57 (19) 32 (20 62 | —43 
(21) 58 (22) 64 (23) 52 (24) 73 (25) 55 +38 
—0.4 +03 —0.1 —6.2 —6.1 


Adjusted treatment totals 
(1) 66.6 (2) 773 (3 44.9 (4) 58.8 (5) 50.9 
(6) 46.9 (7) 47.6 (8) 452 (9) 381 (0) 46.2 
(11) 88.4 (12) 511 (13) 527 (14) 71.6 (15) 747 
(16) 58.3 (17) 46.0 (18) 52.6 (19) 21.5 (20) 51.6 
(21) 61.4 (22) 68.1 (23) 557 (24) 70.6 (25) 52.7 


6. An approximate F-test of the adjusted treatment totals is obtained 
by the method described in the previous section, step 6. In this case the 
quantity that must be subtracted from the treatments s.s. is 


n 
k(n — Пи Е - A 
в-а +) " 
where B, is the unadjusted апа B, the adjusted sum of squares for com- 
ponent (b) of the blocks. Both quantities have already been computed: 
Ba, 621.28, appears in the analysis of variance, while B, 309.28. is the 
Tows s.s. used in computing component (а). 
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7. A comparison with randomized blocks is made as in the previous 
section, step 7. The effective error variance of the lattice is taken as 
nku | 
(k+1) 


[+ 


10.210 Missing Data. If the missing observations are numerous, an 
analysis by the method for randomized blocks is recommended. If it is 
desired to carry out the full analysis, missing values are estimated by the 
formula which minimizes the intra-block error s.s., though as explained 
in section 10.13 this method is only approximate. 
Experiments with no repetition of the basic design. 
(п — Т — rR + G — rkC + kC’ 
а CO жауса DOREM ER 
(т — 1)(k — D (rk — k 1) 
In this formula Т and Ё are the totals for the treatment and replicate 
that contain the missing value, G is the grand total, and C is the C value 


(defined in section 10.28) for the block which contains the missing obser- 
vation. The only unfamiliar quantity is C': 


С" = total of the C values for all blocks which contain the treat- 
ment that has the missing value 


Example. For the experiment analyzed in section 10.28, suppose that 
3 observations are missing: treatments (1) and (12) in replication I and 


(10.2) 


TABLE 10.5 Summary TOTALS FOR А 5 X 5 SIMPLE LATTICE WITH THREE 
MISSING VALUES 


"Treatment totals 
(1) 24 z (2) 28 (3) 21 (4) 25 (5) 21 
(6) 29 (7) 23 (8) 16 (9) 23 (10) 23 
(11) 41 (12) 14 {у (13) 19 (14) 9+2 (15) 36 
(16) 29 (17) 27 (18) 25 (19) 22 (20) 30 
(21) 22 (22) 38 (23) 23 (04) 97 (25) 33 
Rep. I Rep. П 
B С. В. с 
26 += 67 —х 80 —15 +2 
61 _8 80 —30 + y. 
47 +у 25—y +z 56 -8 
74 -15 59 +2 -2-5 
68 17 87 E 


В = 216 = фу 868—z2—9 +z = 362 +2 —86 +z +0 2 
<= В. + Ва = 638 +2 ут? 
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treatment (14) in replication II. Denote the estimates of the 3 missing 
values by т, у, and z, respectively; these will be obtained by the method | 
of successive approximation. | 

It is best to start by finding all block, replicate, and treatment totals 
and all С values, as in steps 1 and 2 of the computing instructions for 
this design. Wherever a missing observation is involved, include the ap- 
propriate х, у, or z in its place. The summary totals are shown in table 
10.5. Note that the total of the С values over the whole experiment is 
identically zero, so that a good check is available on this part of the cal- 
culations. 

We must now find first approximations for у and z. In the replications 
in which they are present, their values are 14 and 9, respectively. How- 
ever, replication II appears to have higher values than replication I, and 
it is probably worth while to make an adjustment for replication effect. 
If we ignore the missing values, the means per plot are about 12 for 
replication I and 15 for replication II. Since y is missing in replication 
I, we subtract З from its value in replication II, giving 11. For z we add 
3, giving 12 as the first approximation. Using y = 11, z = 12, we now 
solve for z from formula (10.2). For r = 2, k = 5 the formula becomes 

р 257 — 2R + G ~ 100 + 56! 
16 

Put у = 11,2 = 18. From table 10.5 we find for x 
Т = 24; Е = 276 + 11 = 287; G = 638 + 11 + 12 = 661 

С = 67; С’= 67—15 = 52 
re (25)(24) — (2)(287) + 661 — (10)(67) + (5)(52)_ 277 _ 
16 ‚16. 

Put x = 17,2 = 12. For y we have 


Т-14; В = 276 +17 + 293; G = 638 + 17 + 12 = 667 
С = 25 + 12 = 37; С' =25-+-12—30=7 
T (25)(14) — (2)(293) + 667 — (10)(37) + (5)(7) _ 96 


=6 
16 16 
Putz = 17, у = 6. Forz we have 
Т=9; Е = 362; б = 638 +17 +6 = 661; С = —2 
C = —2 +25 — 6 = 17 
z = 29/9) - (2)(862) + 661 — (10)(—2) + (5)(17) 267 da. 


16 16 
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This completes the first round. A second round leads to z = 18, у = 
5, 2 = 17, and there is obviously no need for further calculation. The 
rest of the analysis proceeds as usual except that 3 d.f. are omitted from 
the total s.s. and the intra-block error s.s. For approximate t-tests, see 
section 10.13. 

Experiments with repetitions of the basic design. If the basic design has 
п replicatés and these are repeated p times to give т = np replications, 
the formula becomes 
" (п — DET + (n — DrR + G — nkC + kC' — n?R' 

(n — 1)(k — D)(rk — k — 1) 
All symbols have the same meaning as in (10.2) except that C and С” are 
now derived from the totals of groups of similar blocks, just as in the 


statistical analysis for this case in section 10.29. Тһе new quantity R’ 
is defined as 


В! = total of all replications that are similar to the replication con- 
taining the missing value 


2 (10.3) 


Note that this total includes the replication with the missing value. 

Example. Inthe 5 X 5 simple lattice with 4 replications, we haven = 
2,r = 4. If treatment (1) is missing in the first replication, the reader 
may verify that 


т = 53; R= 283; @ = 1303; С = 69; С = 69 — 9 = 60 
R! = 283 + 361 = 644 


This gives 
m (25)(53) + (4)(283) + 1398 — (10)(69) + (5)(60) — (4)(644) 
Б 56 
844 
----16 
56 


10.3 Rectangular Lattices 


10.31 Description. These designs were developed recently by Harsh- 
barger (10.8) for k(k + 1) treatments in blocks of k units. They form а 
useful addition to the square lattices described in previous sections, since 
the allowable numbers of treatments, 12, 20, 30, 42, 56, 72, ete., fall about 
midway between the allowable numbers for square lattices. The sta- 
tistical analysis is quite similar to that for simple and triple lattices, 
though it takes more time because the block adjustments are not so 
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simple as with square lattices. The new designs are less symmetrical 
than the square lattices, in the sense that there is a greater variation in 
the accuracy with which two treatment means are compared. It will 
be recalled that only two standard errors are required for t-tests with the 
simple and triple lattice—one for two treatments that appear in the same 
block and one for two treatments that do not. The simple rectangular 
lattice requires four standard errors, while the triple rectangular lattice 
requires seven. For practical purposes it appears that these can nearly 
always be reduced to two. 

There are several ways of constructing the designs. One (suggested 
by С. S. Watson) is by means of a latin square with (k + 1) rows and 
columns, in which every letter in the leading diagonal is different, When 
writing down the square we attach a number to all letters except those 
in the leading diagonal, as illustrated below for а 4 X 4 square. 


А Ві C2 рз 
D4 С B5 Аб 
Вт A8 р (9 
C10 Dil A12 В 


In the first replication we place in a block all numbers that lie in the 
same row of the latin square, in the second replication all numbers that 
lie in the same column, and in the third all numbers that have the same 
latin letter. 


Block Rep. I Rep. II Вер. Ш 
1 + 2.8 i, 1 -8 it № М 1 $3 
2. 4.5.26 2 2. 5.12 3-2 9 10 
E Жу, Ды | 3! 3 69 зз ап 
4 10 11 12 4.4 7 10 4 6 8 12 


The important feature of this arrangement is that no two treatments 
are in the same block more than once. The use of a latin square with 
different letters down the leading diagonal ensures that in the third 
replication the three numbers associated with any letter are all in dif- 
ferent rows and columns, and hence have not previously occurred to- 
gether in а block. 

Plans 10.10-10.16 were constructed by this method. By using the 
first 2 replications from any plan we obtain a rectangular lattice in 2 
replications, which by means of repetitions сап be used for an experi- 
ment in 4, 6, 8, etc., replications, By using all 3 replications of the plan 
we have designs for 3, 6, 9, еїс., replications. 
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Alternatively (as pointed out by В. 8. Shrikhande), we may construct 
the designs from a balanced lattice with («+ 1)? treatments. If the 
first replication is omitted, and if all treatments that appear in any one 
selected block in the first replicate are omitted in subsequent replica- 
tions, it is easy to verify that we generate a design for k(k + 1) treat- 
ments in which every block is of size k and in which no two treatments 
appear together more than once in the same block. This method gives 
any number of replicates up to К, though of course the method fails when 
no balanced lattice exists, as with 36 treatments. The discussion in this 
section is limited to simple and triple rectangular lattices. The proper- 
ties and method of analysis for the designs with k replicates, called “near 
balance" rectangular lattices by Harshbarger, are described in (10.32). 

The method of randomizing is the same as for lattices (section 10.27). 
10.32 Statistical Analysis. In each plan, the three basic replications 
are described as the X, Y, and Z replications. Further, if we take any 
block in one replication and examine another replication, we find that 
there is one and only one block in the other replication that has no treat- 
ments in common with the chosen block. These two blocks will be called 
partners. Since it is important to be able to distinguish partners in the 
statistical analysis, each block is denoted in the plan by two symbols, 
e.g., Y2, one to mark the replication and one to indicate the partners. 
All partners carry the same number: thus the partners of У2 are X2 and 
Z2. Tt is advisable to write the block labels in the notebook in which 
the original results are recorded. 

Example. The example is a triple rectangular lattice for 12 treatments 
in blocks of З (k = 3). Artificial data were assembled by taking true 
treatment effects as shown in table 10.6, and adding to them true block 
effects as given with each block in table 10.7. Thus the observation 7 


TABLE 10.6 TREATMENT EFFECTS IN ARTIFICIAL DATA FOR A 3 X 4 
TRIPLE RECTANGULAR LATTICE 


True treatment 
Treatment True effect total 
1 14 56 
2 7 35 
3 2 20 
4 0 14 
5 3 23 
6 11 47 
1 9 4 
8 16 62 
9 8 38 
10 T 17 
п 5 29 
12 6 32 
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for treatment (10) in block Х4 of table 10.7 is obtained by adding the 
true treatment effect, 1, to the true block effect, 6. Since the average of 
the true block effects comes out to be 56/12, or 14/3, the true treatment 
totals, over the 3 replicates, will be (3r + 14), where т is the true treat- 


TABLE 10.7 PLAN AND OBSERVATIONS FOR А 3 X 4 RECTANGULAR LATTICE 


(Treatment numbers are enclosed in parentheses) 


Block True block Totals Adjustment 

symbol effect. Rep. X B Cx factor 
X4 6 (10) 7 (12) 12 (11) 11 30 -12 -0.6 
Х\ 2 (2) 9 (3 4 (1)16 29 20 +34 
xa 7 (7) 16 (9 15 (8 23 54 -5 —1.6 
X2 0 (4) 0 (5) з (611 14 30 +54 

127 33 

Вер. У В Су 
уа 9 (311 (6) 20 (917 48 —32 -46 
Y2 3 (1) 17 (1) 8 (9 19 44 10 +14 
ҮЗ 3 (12) 9 (2)10 (5) 6 25 10 +14 
Y1 5 (10) 6 (4) 5 (7) 14 25 0 —0.6 

142 -12 

Бер. 2 В Cz 
21 4 (8 20 (6) 15 (12) 10 45 4 +02 
22 8 (9) 16 (10) 9 (2) 15 40 —16 —3.8 
23 1 а) 6 (98 (4) 1 10 19 +32 
24 8 (бп (022 (017 50 —28 -38 

145 -21 

Treatment totals 


l 2 9 4 "5 6—7 8 9 10 11.12 


Unadj. 55 34 18 6 20 46 47 62 48 22 25 31 
Adj. 56 35 20 14 23 47 41 62 38 17 29 32 


ment effect. "These figures are shown in the column at the right in table 
10.6. 

No intra-block error has been introduced into the data, which there- 
fore provide two checks on the method of analysis. First, in the analysis 
of variance the intra-block error s.s. should be found to be identically 
zero. Second, the adjusted treatment totals as found from the analysis 
should be exactly equal to the true treatment totals. 

The computing instructions given below apply to either the simple or 
the triple rectangular lattice without repetitions. The changes required 
when there are repetitions are given later. 
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1. Find the block totals, B, the replication totals, the treatment totals, 


and the grand total. 
2. For each block calculate 


С = total (over all replicates) of all treatments in the block — rB 


Thus for block У4, 
С = 18 + 46 + 48 — (3) (48) = —32 
As usual, the total of the C’s is zero. 
3. Arrange the C values in a supplementary table so that partners ap- 
pear in the same row. The row totals of this table, S, give the sums of 
the C values for each set of partners. The column totals give values 


denoted by Re. 


TABLE 10.8 SUPPLEMENTARY TABLE OF C VALUES 


Block 
symbol Cx = С СЕ Mx My Mz uS 
1 20 0 4 24 4.0 0.0 0.8 0.6 
2 30 10 —16 24 6.0 2.0 —3.2 0.6 
3 -5 10 19 24 -1.0 2.0 3.8 0.6 
4 -12 -32 -28 -72 -2.4 —6.4 -5.6 —1.8 
Totals (Ес) 33 -12 — 0 6.6 —2.4 —4.2 0.0 


4. The analysis of variance is now obtained. АП sums of squares are 
found in the usual way, except that for blocks, adjusted for treatments, 


which is 
БЖ С? 25 Re % 852 
r(rk — k— 1) " r(k + (rk — k — 1) x r(r — 1)(k + D) (rk — k — 1) 
(20)? 4- (30)? +.--+ (28)? (33)?+(12)?-+ (21)? ы. (24)?-+-+-+(72)? 


15 60 120 
= 274.0 — 27.9 — 57.6 = 188.5 (10.4) 
TABLE 10.9 ANALYSIS OF VARIANCE 
Source of variation d.f. 8.8. m.s. 
Replications (r — 1) 2 15.5 
Treatments (k +k- 1 11 1067.0 
Blocks rk 9 188.5 20.9Еь 
Intra-block error (--1)8-1)-Е 13 0.0 0.0Е, 
Total (ri? + rk — 1) 35 12710 


As anticipated, the intra-block error 8.8. in table 10.9 is zero. 
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5. We now calculate the weighting factors used to obtain the adjusted 
treatment totals. Although a general formula can be given, it is simpler. 
to present these separately for simple and triple designs. Two weighting 
factors are necessary.* 1 


Simple rectangular lattice m 
м т(Еь — Е.) a 
ҚҰ- DE, + (rk — 2k + HE, ао 
M(E, — E.) X «| 
UU WERDE + (k= 2k— 5E diam O 
Triple rectangular lattice т 
"Е, ей Е.) 
Ае----------- $ 
r(2k — 1)Еь + (rk — 3k + r)E, ao 
Х"(Еь — Е, M 
КЮ = Е) (10.8) 


КО Е DE, + (8 ЗЕ С E, 1430 
In this example Е, is zero and we have 
0.2)(3 
а ты OT 
(3) (5) (2)(3) (4) 

6. Complete table 10.8 by adding the columns ACx, ACy, ACz; and 

uS. The adjustment for any block is 
AC — uS a 
where S is taken from the row in which C lies in table 10.8. Thus for 
block X1 the adjustment is 
+4.0 — 0.6 = +3.4 


These adjustments are recorded on the plan in table 10.7. Мое that 
the order in which blocks appear is different in tables 10.7 and 10.8; care 
must be taken to see that each adjustment is given to the appropriate 
block. 

7. Finally, each treatment total is adjusted by adding the adjustments 
for every block in which the treatment appears. For instance, the | 
adjusted total for treatment (4) is 


6 + 5.4 — 0.6 + 3.2 = 14 


* This method of estimating the weights is slightly different from that given by 
Harshbarger. In the interests of simplicity we have used only the pooled mean 
square for blocks and the intra-block error m.s. for estimating ХА and и. 
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Every adjusted treatment total will be found to be equal to the true 
treatment total as given in table 10.6. 

Statistical analysis for repetitions of the designs, When the basic design 
with n replications is repeated p times to give r = np replications, the 
changes required in the steps of the analysis are noted below. 

1. No change. 

2a. For any block there are (p — 1) other blocks that have exactly 
the same set of treatments. Arrange the totals for such similar blocks 
in two-way tables, as exemplified below. There are n such tables, 


X Blocks 
Repetition Total 
Block no. 12... р (= B) 
1 
2 
(0 
Total 


2b. For each group of similar blocks calculate x 

C = total (over all replicates) of all treatments in the group — n() 8) 
3. No change. 

4a. The blocks s.s. now has two components. For component (a) ob- 
tain the interaction s.s. for each of the tables in step 2a, and add these 
sums of squares. Since each block total contains Ё observations, a 
divisor k is required for the analysis of variance. 

4b. The sum of squares for component b is 


» L Ве P 25% 
ПАБЕ) rk+1)(nk—k—1) қа Е Ев 1) 


The separation of degrees of freedom in the analysis of variance is shown 
below. 


d.f. m.s. 
Replications CHD 
Treatments +k- 1) 
Blocks rk Er 
Component (a) (- nk 


Component (b) nk 
Intra-block error (r—D(?—-)-k E 


Total (rk? 4- rk — D 
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5. The weighting factors in formulae (10.5)-(10.8) are unchanged. 
Note that Es is the pooled mean square for blocks. 

6. Unchanged. 

7. Unchanged. 

As mentioned previously, in order to have t-tests for every pair of 
treatments we would require to present 4 standard error formulae for 
simple rectangular lattices and 7 for triple rectangular lattices. With 
very little inaccuracy these can be reduced to 2, one for the case where 
the two treatments appear in a block and one for the case where this is 
not so. For the difference between two adjusted means the estimated 
error variances are: 

Simple rectangular lattice 


2Е, 
Two treatments in the same block: TE (1 +A — p) 


Two treatments not in same block: = (1 + 24 — и) 


„„2Е,[ 2% — 1) а -к+1 
Average: [PEE ажат 1049-0 
Б bs] 
uae (k? +k -1) 


Triple rectangular lattice 
Two treatments in the same block: E 4-23 — u) 


Two treatments not in same block: 2. (1 +3 — 2") 


|, Bef 3—1). K- 2 +2 3 
ыы Up [Р т-та Еа БЕТІ (а ДВА -291 

In the example, where А = 0.2, и = 0.025, the factors multiplying the 
usual term 2E,/r are 1.375 for two treatments in the same block and 
1.562 for two treatments not in the same block. The factor in the aver- 
age variance is 1.460. 


10.4 Cubic Lattices 


10.41 Description. These designs were produced by Yates for plant- 
breeding work in which selections are to be made from an unusually large 
number of varieties. The number of treatments must be an exact cube. 

* Owing to the fact that only two standard errors were used, this average is not 


quite equal to the average variance taken over all possible pairs, though it is very 
close to tha* value. 
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The most useful range comprizes 27, 64, 125, 216, 343, 512, 729, and 
1000 treatments. The size of block is the cube root of the number of 
treatments, i.e., 3, 4, 5, 6, 7, 8, 9, and 10, respectively. Thus cubic lat- 
tices can accommodate a large number of treatments in a small size of in- 
complete block. The designs have been used, for example, in an ex- 
periment with 729 strains of ponderosa pine seedlings (10.9) and an ex- 
periment with 729 soybean varieties (10.10). The number of replicates 
must be 3 or some multiple of 3. 

Since the plans are easy to construct and since they occupy a соп- 
siderable amount of space for the higher numbers of treatments, they are 
not reproduced here. To obtain a plan, the K? treatments are numbered 
by means of a three-digit code in which each digit takes all values from 
1 tok. For 27 treatments, the code is as follows: 


T* Code T Code T Code 
1 11 4 121 75! 181 
2 211 5 221 8 231 
3 311 6 321 9 331 

10 112 13 122 16 182 
11 212 14 222 17 292 
12 312 15 322 18 332 
19 113 22 128 25 133 
20 213 23 223 26 283 
21 313 24 323 27 333 


* Treatment number. 


The same principle applies with a larger number of treatments. For 
the first k treatments, the last two digits are fixed at (11) while the first 
digit runs from 1 to k. The next k treatments are coded by fixing the 
last two digits at (21) while the first digit again runs from 1 to k, and so 
on in a systematic manner until the final k treatments are reached, for 
which the last two digits have the fixed values (kk). 

Within each of the 3 replications, the F? treatments are grouped into 
k? blocks, each of size k. In the first replication, the rule for this group- 
ing is to keep the last two digits constant within a block, allowing the 
first digit to take all values from 1tok. Thus, in the example above, the 
9 groups of treatments constitute the 9 blocks, block 1 containing the 
treatments (111), (211), and (311). 

To form blocks in the second replication, we keep the first and last 
digits fixed within any block and give the second digit all values from 
1 to k. With 27 treatments, the first block therefore contains (111), 
(121), and (131), the second block (211), (221), and (231), and the last 
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block (313), (323), and (333). In the third replication, the first and 
second digits are constant within each block. 
The composition of the blocks in the second and third replications is 


shown in table 10.10. 


TABLE 10.10 SECOND AND THIRD REPLICATES OF A CUBIC LATTICE WITH 
27 TREATMENTS 


Block (1) 
Code T * 


Hio 
121 4 
13 4 


(6) 
Code T 


312 12 
322 15 
332 18 


Block (1) 
Code T 


I E 
112 10 
113 19 


(6) 
Code T 


321 6 
322 15 
323 24 


* Treatment number. 


ы 
= 
= 

oan N 


(2) 
Code T 
21 2 
212 11 
213 20 


(7) 
Code Т 
131 7 
132 16 
133 25 


Rep. II 
(3) (4) 
Code Т Code Т 
зи 3 112 10 
321 6 122 13 
331 9 132 16 
(8) (9) 


Rep. III 


(3) (4) 
Сое Т Сое Т 
311 3 121 4 
312 12 122 13 
313 21 123 22 


(8) (9) 
Code T Code T 
231 8 331 9 
232 17 332 18 
233 26 333 27 


(5) 
Code Т 
212 11 
222 14 
232 17 


(5) 
Code Т 
221 5 
22 14 
223 23 


The original treatment numbers are shown as well as the code num- 
bers. In an experiment it is simplest to record only the code numbers, 


which are required in order to follow the computing instructions. 


Cubic lattices may be expected to be most useful when the number of 
treatments exceeds 100. The designs for k = 3, 4, 5, and 6 lie in the 
range which is also covered by lattice designs. We do not know of any 
investigations of the relative accuracy of lattice and cubic lattice designs 
in these cases. With highly variable experimental material, the use of a 
smaller block might give the cubic lattice some advantage; on the other 
hand the statistical analysis is more laborious owing to the greater num- 
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ber of block adjustments to be calculated. Like the lattice designs, 
cubie lattices cannot be appreciably less accurate than randomized 
blocks which occupy the same set of replications, and can be analyzed 
by the method for randomized blocks. 

Some cubic lattices can also be arranged in quasi-latin squares (10.11) 
so as to allow the elimination of two sources of error variation. 


10.42 Arrangement of Experimental Material. The most important 
rule is $0 have units in the same incomplete block as homogeneous as the 
experimental material permits; in field trials the blocks should be com- 
pact in shape. If practicable, blocks within the same replicate should 
also be similar to each other, so as to increase the accuracy of inter-block 
comparisons and to obtain more precise results if the analysis is sub- 
sequently carried out by the method for randomized blocks. 


10.43 Randomization. The steps are: 

1. Randomize the order of the blocks independently within each rep- 
lication. 

2. Randomize the positions of the treatment code numbers inde- 
pendently within each block. 

3. Assign treatments to code numbers at random. 


10.44 Statistical Analysis. The following reference should be con- 
sulted. 


(10.12) Үлтев, Е. The recovery of inter-block information in variety 
trials arranged in three-dimensional lattices. Ann. Eugen. 9, 
136-156, 1939. 


General formulae are given with a numerical example of the analysis for 
an experiment with 64 treatments. In following this example, the 
reader should note that the experiment, was not arranged in separate 
replications; the changes introduced for this reason are pointed out by 
Yates (рр. 147-148). The method of analysis when 6 or 9 replicates are 
used is also indicated (р. 144). Reference (10.13) describes the analysis 
of an experiment with 125 treatments. 


10.46 Error Variances. The formulae for the error variance of the 
difference between two treatment means are summarized below. In the 
analysis as described by Yates (10.12), the symbols 
ш = ш ш = w 
д інін жер ee 
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play a prominent part. It is convenient to write M = А/Ё?, ш = p/k? 
The error variances for the different types of treatment comparisons are 


as follows: 
Comparison Variance of difference between means 

2Е, 

ini — іш Tu [1 + 2” + 2(k — 241 
2E. 

tiz — Ви p [1 + 4' + (8k — 9,1 
2Е, 

бз — (и == [1 + 60’ + 3(k — 2,7 
2E, № Я В 

жены os АВЕО +80 nen] 


Here Е, is the intra-block error m.s. and г the number of replicates 
(3, 6, or 9). The formulae, though written in a slightly different form, 
are identical with those given by Yates. Our k corresponds to Yates's 
р, and our т to his n. 


10.1 


10.2 
10.3 


10.4 
10.5 


10.6 
10.7 
10.8 
10.9 


10.10 
10.11 


10.12 


10.13 
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PLANS 
Plan 10.1 3 X 3 balanced lattice 
1= 9, к = 3,т = 4, = 12, А = 1* 
Block Rep. І Rep. П Rep. III Rep IV 
(1) 1 2 3 (4) 147 () 159 (10) 186 
(2) 456 (5) 258 (8) 7 2 6 (1) 429 
() 789 (0 3 6 9 (9) 4 8 3 (12) 7 5 8 
Plan 10.2 4 X 4 balanced lattice 
t=16,k = 4,7 = 5, = 20, А = 1 
Block Rep. I Rep. II Rep. Ш 


122-5245. (0 1 8 9 18 (9 1 6 i 16 
(2) (0 2 6 10 14 (0) 5 2 15 12 
@ 9101112 9 3 71 15 (0) 914 3 8 
(0 18 14 15 16 (8 4 8 12 16 (12) 13 10 7 4 


= 
o 
= 
© 


Rep. IV Rep. V 
а) 1 14 7 12 (7 1 10 15 8 
(14) 13 211 8 (18) 9 2 7 16 
15 5 10 3 16 (19) 13 6 3 12 
(16 9 6 15 4 (0 5141 4 


Plan 10.8 5 X 5 balanced lattice 
t= 25,6 = 5,т = 6, =30,A = 1 
Block Rep. I Rep. II Бер. Ш 


(0 12345 (9 1 61 1 31. a) 1 7 1 19 3 
@ 6 7 8 91 (7 2712122 (2 21 2 8142 
(3 11 12 13 14 15 (8 3 8 13 18 28 (13) 16 22 3 915 
(4) 16 17 18 19 20 (9 4 9 14 10 24 (14 11 17 28 410 
(5) 21 22 23 24 25 (10 5 10 15 20 25 (15) 6 12 18 ^ 5 


Rep. IV Rep. Y Rep. VI 
(16) 1 12 23 9 20 (2) 1 17 8 24 15 (26) 1 22 18 14 10 
00 16 2 13 24 10 (22) П 2 18 9 25 (2) 6 2 23 19 15 
(48) 6 17 3 14 25 (23) 21 12 3 19 10 Qs) di 7 3 24 20 
09 21 7 18 4 15 (м) 6 22 13 4 20 (29) 16 12 8 4 25 
00 П 22 8 19 5 (916 T7 28 M 5 (x) 2n 17 13 9 5 


* The symbol A denotes the number of times that two treatments appear in the 
same block. 
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Plan 10.4 7 X 7 balanced lattice 
1249 Е =7,r=8b=56,. =1 

Block Rep. I Rep. П 
123402007 (8 1 8 15 22 29 36 43 
(2 8 9 оп 12 13 М (9 2 9 16 23 30 37 44 
(3 15 16 17 18 19 20 21 (10 8 10 17 24 31 38 45 
(4) 22 23 24 25 26 27 28 (1) 4 11 18 25 32 39 46 
(5 29 30 31 32 33 34 35 (12) 5 12 19 26 33 40 47 
(6) 36 37 38 39 40 41 42 (13) 6 13 20 27 34 41 48 
(7) 43 44 45 46 47 48 49 (14 7 14 21 28 35 42 49 

Rep. III Rep. IV 
(15 1 9 17 25 33 41 49 (22) 1 37 24 11 47 34 21 
(16) 43 2 10 18 26 34 42 (23) 15 2 38 25 12 48 85 
(17) 36 44 3 11 19 27 35 (24) 29 16 3 39 26 13 49 
(18) 29 37 45 4 12 20 28 (25) 43 30 17 4 40 27 14 
(19) 22 30 38 46 5 18 21 (26) 8 44 81 18 541 28 
(20 15 23 31 39 47 6 14 (27) 22 9 45 32 19 6 42 
(21) 8 16 24 32 40 48 7 (28) 36 23 10 46 33 20 7 

Rep. V Rep. VI 
(29) 1 30 10 39 19.48 28 (36) 1 23 45 18 40 13 35 
(30) 22 2 81 11 40 20 49 (37 29 224 46 19 41 14 
(31) 43 23 3 32 12 41 21 (38) 8 30 3 25 47 20 42 
(32) 15 44 24 4 33 13 42 (39) 36 931 4 26 48 21 
(33) 36 16 45 25 5 34 14 (40) 15 37 10 32 5 27 49 
(34 8 37 17 46 26 6 35 (41) 43 16 38 11 6 28 
(35) 29 9 38 18 47 27 7 (42) 22 44 17 39 12 34 7 

Вер. ҮП Вер. ҮШ 
(43) 1 16 31 46 12 27 42 (50) 1 44 38 32 26 20 14 
(44 36 2 17 32 47 13 28 (5) 8 2 45 39 33 27 21 
(45) 22 37 3 18 33 48 14 (52) 15 9 3 46 40 34 28 
(46) 8 23 88 4 19 34 49 (53) 22 16 10 4 47 41 35 
(47) 43 9 24 39 5 20 35 (4) 29 23 17 11 5 48 42 
(48) 29 44 10 25 40 6 21 (55) 36 30 24 18 12 6 49 
(49) 15 30 45 11 26 41 7 (56) 43 37 31 25 19 13 7 
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Plan 10.5 8 X 8 balanced lattice 
(= 6, =8,т =9,Ь = 72, А = 1 

Block Rep. I Rep. II 
lo ie rei aL (9 1 9 17 25 33 41 49 57 
@ 9 10 11 12 13 14 15 16 (0) 2 10 18 26 34 42 50 58 
(3) 17 18 19 20 21 22 23 24 (1) 3 П 19 27 35 43 51 59 
(4 25 26 27 28 29 30 31 32 (12 `4 12 20 28 36 44 52 00 
(5) 33 34 35 36 37 38 39 40 (13) 5 13 21 29 37 45 53 61 
(6) 41 42 43 44 45 46 47 48 (14) 6 14 22 30 38 46 54 62 
(7) 49 50 51 52 53 54 55 56 (15) 7 15 23 31 39 47 55 63 
(8 57 58 59 60 61 62 63 64 (16) 8 16 24 32 40 48 56 64 

Rep. Ш Кер. ТУ 
(17) 110 19 28 37 46 55 64 (25) 1 18 27 44 13 62 39 56 
(8) 9 2 51 44 бі 30 23 40 (26) 17 2 35 60 53 46 31 16 
(19) 17 50 3 36 29 62 15 48 (27) 25 34 3 12 45 54 23 64 
(20) 25 42 35 4 21 14 63 56 (28) 41 58 11 4 29 22 55 40 
(21) 83 58 27 20 5 54 47 16 (29) 9 50 43 28 5 38 63 24 
(22) 41 26 59 12 53 639 24 (30) 57 42 51 20 37 6 15 32 
(23) 49 18 11 60 45 38 7 32 (81) 83 26 19 52 61 14 7 48 
(24) 57 34 43 52 13 22 31 8 (32) 49 10 59 36 21 30 47 8 

Rep. V Rep. VI 
(33) 1 26 43 60 21 54 15 40 (41) 1 34 11 20 53 30 63 48 
(34) 25 2 11 52 37 62 47 24 (42) 83 2 59 28 45 22 15 56 
(85) 41 10 3 20 61 38 31 56 (43) 9 58 3 52 21 46 39 32 
(36) 57 50 19 4 45 30 39 16 (44) 17 26 51 4 13 38 47 64 
(37) 17 34 59 44 5 14 55 32 (45) 49 42 19 12 5 62 31 40 
(38) 49 58 35 28 13 6 23 48 (46) 25 18 43 30 61 6 55 16 
(39) 9 42 27 36 53 22 7 64 (47) 57 10 35 44 29 54 7 2 
(40) 33 18 ol 12 29 46 63 8 (48) 41 50 27 60 37 14 23 8 
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Plan 10.5 (Continued) 8 X 8 balanced lattice 

Rep. VII Rep. VIII 
(49) 1 42 59 52 29 38 23 16 (57) `1 50 35 12 61 22 47 32 
(50) 41 2 19 36 13 54 63 32 (58) 49 2 43 20 29 14 39 64 
(51) 57 18 3 28 53 14 47 40 (50) 33 42 3 60 13 30 55 24 
(52) 49 34 27 4 61 46 15 24 (60) 9 18 59 4 37 54 31 48 
(53) 25 10 51 60 5 22 39 48 (61) 57 26 11 36 5 46 23 56 
(54) 33 50 11 44 21 6 81 64 (62) 17 10 27 52 45 6 63 40 
(55) 17 58 43 12 37 30 7 56 (63) 41 34 51 28 21 62 7 16 
(56) 9 26 35 20 45 62 55 8 (64) 25 58 19 44 53 38 15 8 

Rep. ІХ 
(65) 1 58 БІ 36 45 14 31 24 
(66) 57 2 27 12 21 38 55 48 
(67) 49 26 3 44 87 22 63 16 
(68) 33 10 43 4 53 62 23 32 
(69) 41 18 35 52 5 30 15 64 
(70) 9 34 19 60 29 6 47 56 
(71) 25 50 59 20 18 46 7 40 
(72) 17 42 11 28 61 54 39 8 
Plan 10.6 9 X 9 balanced lattice 

t =81,k =9,r =10,b = 90,4 =1 

Block Rep. I Rep. II 
() 118456789 (10) 110 19 28 37 46 55 64 73 
(2) 10 11 12 13 14 15 16 17 18 (1) 211 20 29 38 47 56 65 74 
(3) 19 20 21 22 23 24 25 26 27 (12) 3 12 21 30 39 48 57 66 75 
(4) 28 29 30 31 82 33 34 35 36 (13) 413 22 31 40 49 58 67 76 
(5) 37 38 39 40 41 42 43 44 45 (14) 5 14 23 32 41 50 59 68 77 
(6) 46 47 48 49 50 51 52 53 54 (15) 615 24 33 42 БІ 60 69 78 
(7) 55 56 57 58 59 60 61 62 63 (16) 7 16 25 34 43 52 61 70 79 
(8) 64 65 66 67 68 69 70 71 72 (1) `8 17 26 35 44 53 62 71 80 
(9 73 74 75 76 77 78 79 80 81 (18 9 18 27 36 45 54 63 72 81 

Rep. III Rep. IV 
(19) 120 12 58 77 69 34 53 45 (28) 1 11 21 31 41 51 61 71 81 
(20) 10 2 21 67 59 78 43 35 54 (29) 19 2 12 49 32 42 79 62 72 
(21) 19 11 3 76 68 60 52 44 36 (30) 10 20 3 40 50 33 70 80 63 
(22) 28 47 39 4 23 15 61 80 72 (31) 55 05 75 4 14 24 34 44 54 
(23) 37 29 48 13 5 24 70 62 81 (32) 73 56 66 22 5 15 52 35 45 
(24) 46 38 30 22 14 6 79 71 63 (33) 64 74 57 13 23 6 43 53 36 
(25) 55 74 66 31 50 42 7 26 18 (34) 28 38 48 58 08 78 7 17 27 
(26) 64 56 75 40 32 51 16 8 27 (35) 46 29 39 76 59 69 25 8 18 
(27) 73 65 57 49 41 33 25 17 9 (36) 37 47 30 67 77 60 16 26 9 
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Plan 10.6 (Continued) 9 X 9 balanced lattice 

Rep. V Rep. VI 
(87) 1 29 57 22 50 78 16 44 72 (46) 1 56 30 13 68 42 25 80 54 
(38) 55 2 30 76 23 51 70 17 45 (47) 28 2 57 40 14 69 52 26 81 
(89) 28 56 3 49 77 24 43 71 18 (48) 55 29 3 67 41 15 79 53 27 
(40) 10 38 66 4 32 60 25 53 81 (49) 19 74 48 4 59 33 16 71 45 
(41) 64 11 39 58 5 33 79 26 54 (50) 26 20 75 31 5 60 43 17 72 
(42) 37 65 12 31 59 6 52 80 27 (51) 73 47 21 58 32 6 70 44 18 
(43) 19 47 75 13 41 69 7 35 63 (52) 10 65 39 22 77 51 7 62 36 
(44) 73 20 48 67 14 42 61 8 36 (53) 37 11 66 49 23 78 34 8 63 
(45) 46 74 21 40 68 15 34 62 9 (54) 04 38 12 76 50 24 61 35 9 

Rep. VII Rep. VIII 
(55) 1 47 66 76 14 33 43 62 27 (64) І 74 39 67 32 24 52 17 68 
(56) 64 2 48 31 77 15 25 44 63 (65) 37 2 75 22 68 33 61 53 18 
(57) 46 65 3 18 32 78 61 26 45 (66) 73 38 3 31 23 69 16 62 54 
(58) 37 56 21 4 50 69 79 17 36 (67) 46 11 57 4 77 42 70 35 27 
(59) 19 38 57 67 5 БІ 34 80 18 (68) 55 47 12 40 5 78 25 71 36 
(60) 55 20 39 49 68 6 16 35 81 (69) 10 56 48 76 41 6 34 26 72 
(61) 78 11 30 40 59 24 7 53 72 (70) 64 29 21 49 14 60 7 80 45 
(62) 28 74 12 22 41 60 70 8 54 (71) 19 65 30 58 50 15 43 8 81 
(63) 10 29 75 58 23 42 52 71 (72) 28 20 66 13 59 51 79 44 9 

Rep. IX Rep. X 
(73) 165 48 4С 23 60 79 35 18 (82) `1 38 75 49 59 15 70 26 36 
(74) 46 2 66 58 41 24 16 80 36 (83) 73 2 39 13 50 60 34 71 27 
(75) 64 47 3 22 59 42 34 17 81 (84) 37 74 3 58.14 51 25 35 72 
(76) 73 29 12 4 68 51 43 26 63 (85) 64 20 30 4 41 78 52 62 18 
(7) 10 74 30 49 5 69 61 44 27 (86) 28 65 21 76 5 42 16 58 63 
(78) 28 11 75 67 50 6 25 62 45 (87) 19 29 66 40 77 6 61 17 54 
(79) 37 20 57 76 32 15 7 71 54 (88) 46 56 12 67 23 33 7 44 8l 
(80) 55 38 21 13 77 33 52 8 72 (89) 10 47 57 31 68 24 79 8 45 
(81) 19 56 39 31 14 78 70 53 9 (90) 55 11 48 22 32 69 43 80 9 
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Plan 10.7 6 X 6 triple lattice 
Block Rep. I Rep. П 
@ 1 2 3.4455 16 ( 1 7 18 19 25 31 
(2 7 8. 951004 42 (8 2 8 14 20 26 32 
(3 13 14 15 16 17 18 (9) 3 9 15 21 27 38 
(4) 19 20 21 22 23 24 (10 4 10 16 22 28 34 
(5) 25 26 27 28 29 30 (1) 5 n 17 23 29 35 
(6) 31 32 33 34 85 36 (12) 6 12 18 24 30 36 

Вер. Ш 

(13) 1 8 15 22 29 36 
(14) 31 2 9 16 23 30 
(15) 25 32 3 10 17 24 
(16) 19 26 33 4 11 18 
(17) 13 20 27 34 5 12 
(18) 7 14 21 28 35 6 

Plan 10.8 10 X 10 triple lattice 

Block Rep. I Rep. II 


(0123456789 10 


(11) 1 11 21 31 41 51 61 71 81 91 


(2) 11 12 18 14 15 16 17 18 19 20 
(3) 21 22 23 24 25 26 27 28 29 30 
(4) 31 32 33 34 35 36 37 38 39 40 
(5) 41 42 43 44 45 46 47 48 49 50 
(6) 51 52 53 54 55 56 57 58 59 60 
(7) 61 62 63 64 65 66 67 68 69 70 
(8 71 72 73 74 75 76 77 78 79 80 
(9) 81 82 83 84 85 86 87 88 89 90 


(12) 2 12 22 32 42 52 62 72 82 92 
(13) 3 13 23 33 43 53 63 73 83 93 
(14) 4 14 24 34 44 54 64 74 84 94 
(15) 5 15 25 35 45 56 65 75 85 95 
(16) 6 16 26 36 46 56 66 76 86 96 
(17) 71727 37 47 57 67 77 87 97 
(18) 8 18 28 38 48 58 68 78 88 98 
(19) 9 19 29 39 49 59 69 79 89 99 


(10) 91 92 93 94 95 96 97 98 99 100 


Rep. Ш 
(21) 1 12 23 34 45 56 07 78 89 100 
(22) 91 2 13 24 35 46 57 68 79 90 
(23) 81 92 3 14 25 36 47 58 69 80 
(24) 71 82 93 4 15 26 37 48 59 70 
(25) 61 72 83 94 5 16 27 38 49 60 
(26) 51 62 73 84 95 6 17 28 39 50 
(27) 41 52 63 74 85 96 7 18 29 40 
(28) 31 42 53 64 75 86 97 8 19 30 
(29) 21 32 43 54 65 76 87 98 9 20 
(30) 11 22 33 44 55 66 77 88 99 10 


(20) 10 20 30 40 50 60 70 80 90 100 
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Plan 10.9 


Block 


(1) 
(2) 
(3) 
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12 X 12 quadruple lattice 


Rep. I 
ШОМ S 34 6.47 48^ 9 10 d 12 
is idee 15 УИ 18 410 50020 52 73 . 24 
25 Ho0M 22 1985199 30 81 1325 33. 34 25 36 
37 38 39 40 41 42 43 44 45 46 47 48 
49 50 51 52 53 54 55 56 57 58 59 60 
61 62 63 64 65 66 67 68 69 70 71 72 
УЗ to: Иб 10 US 19 80,81 82 № M 
8 86 87 88 80 90 91 92 93 94 95 96 
97 98 99 100 101 102 103 104 105 106 107 108 
109 110 111 112 113 114 115 116 17 118 119 120 
121 122 123 124 125 126 127 128 129 130 131 132 
133 134 135 136 137 138 139 140 141 142 143 144 

Вер. П 
І 18 25 37 49 61 73 85 97 109 121 133 
2 14 26 38 50 62 74 86 98 10 122 134 
78 15 27 39 51 63 75 87 99 111 123 135 
4 16 28 40 52 64 76 88 100 112 124 136 
75 17 29 41 53 65 77 89 101 113 125 137 
6 18 30 42 54 66 78 90 102 114 126 138 
7 19 31 43 55 67 79 91 108 115 127 139 
8 20 32 44 56 68 80 92 104 116 128 140 
9 21 33 45 57 69 81 93 105 117 129 141 
10 22 34 46 58 70 82 94 106 118 130 142 
П 23 35 47 59 71 88 95 107 119 131 143 
12 24 36 48 60 72 84 96 108 120 132 144 

Rep. III 
1 14 27 40 57 70 83 96 101 114 127 140 
2 13 28 39 58 60 54 95 102 113 128 139 
3 16 25 38 59 72 81 94 103 116 125 138 
4 15 26 37 60 71 82 93 104 115 126 137 
5 18 31 44 49 62 75 88 105 118 131 144 
6 17 32 43 50 61 76 87 106 117 132 143 
7 20 29 42 51 64 73 86 107 120 129 142 
8 19 30 41 52 63 74 85 108 119 130 141 
9 22 35 48 53 66 79 92 97 110 123 136 
10 21 36 47 54 65 80 91 9 109 124 135 
П 24 33 46 55 68 77 90 99 112 121 134 
12 23 34 45 56 67 78 89 100 111 122 133 
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Plan 10.9 (Continued) 12 X 12 quadruple lattice 
Rep. IV 
(87) 1 24 30 43 53 64 82 95 105 116 122 135 
(39 2 23 29 44 54 63 81 96 106 115 121 136 
(39) з 22 32 41 55 62 84 93 107 114 124 133 
(40) 4 21 31 42 56 61 83 94 108 113 123 134 
(41) 5 16 34 47 57 68 74 87 97 120 126 139 
(42) 6 15 33 48 58 67 73 88 98 119 125 140 
(43) 7 14 36 45 59 66 76 85 99 118 128 137 
(44) 8 13 35 46 60 65 75 86 100 117 127 138 
(45) 9 20 26 39 49 72 78 91 101 112 130 143 
(6) 10 19 25 40 50 71 77 92 102 111 129 144 
(7) 11 18 28 37 51 70 80 89 103 110 132 141 
(48) 12 17 0 38 52 69 79 90 104 109 131 142 
Plan 10.10 ЗХ 4 rectangular lattice 
Block Rep. X Rep. У Rep. Z 
Xi 1 $2. 8 үҮї 4 Bie 21 6 8 12 
Х2 4 5 6 Ya! SH 222 9 10 
xe 7T 8 9 ҮЗ 2 5 12 23 3 41 
X4 10 11 12 ys 3 6 9 Z4 1 5 Т 
Plan 10.11 4 X 5 rectangular lattice 
Block Rep. X Rep. Y Rep.Z 
Жі 1 23:4 Yi 5 9 19.17 Zi 8 П 15 18 
X9 546 8 y2 1 10 14 18 22 2 9 16 2 
X3 9 10 11 12 732 6:16; 019 28 4. 714,17 
X4 13 14 15 16 ү4-:8 751420 24. 1, 5 12 19 
X5 17 18 19 20 Ү5 4 8 12 16 25 3 6 10 13 
Plan 10.12 5 X 6 rectangular lattice 
Block Rep. X Rep. У Rep. 2 
Xi 1.2 3 4 8 yi 6 11 16 21 26 Zi 7 13 19 25 27 
x2 6 7 8 9 10 y2 1 12 17 22 27 Z2 5 14 16 23 29 
X8 11 12 13 14 15 Y3 2 T7 18 23 28 Z3 1 8 20 21 80 
X4 16 17 18 19 20 YA 3 8 13 24 29 Z4 2 9 15 22 26 
X5 21 22 23 24 25 y5 4 9 14 19 30 Z5 3 10 11 17 28 
X6 26 27 28 29 30 Yo 5 10 15 20 25 Z6 4 6 12 18 24 
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Plan 10.13 6 X 7 rectangular lattice 
Block Rep. X Rep. У 
ОТТАРЫ SO qe беті Yl. 7 13 19 25 31 97 
ЕН Ча Yo 1.14 20 26 32 38 
ХЗ 13 М 15 16 17 18 Ya 2 8 21 27 33 99 
X4 19 20 21 22 23 24 YA 3 9 15 28 34 40 
X5 25 26 27 28 29 30 Y5 4 10 16 22 35 41 
X6 31 32 33 34 35 36 Үб 5 11 17 23 29 42 
X7 37 38 39 40 41 42 Y7 6 12 18 24 30 36 
Rep.Z 
Z1 12 17 22 28 33 38 
Z2 2 18 24 29 35 40 
Z8 4 9 20 25 36 42 
Z4 6 11 16 27 32 37 
Z5 1 7 18 23 34 39 
Z6 83 8 14 19 30 41 
27 5 10 15 21 26 31 
Plan 10.14 7 X 8 rectangular lattice 
Block Rep. X Rep. Y 
ЖІ; 17259 М 57 6970 үй) 8516722 29 86 43 50 
X2 8 9 10 11 12 13 14 Y2 1 16 23 30 37 44 51 
ХЗ 15 16 17 18 19 20 210 уз 2 9 24 31 38 45 52 
X4 22 23 24 25 26 27 28 Y4 3 10 17 32 39 46 58 
X5 29 30 31 32 33 34 35 Y5 4 11 18 25 40 47 54 
X6 36 37 38 39 40 41 42 Y6 5 12 19 26 33 48 55 
X7 48 44 45 46 47 48 49 үт 6 13 20 27 34 41 56 
X8 50 51 52 53 54 55 56 Y8 7 14 21 28 35 42 49 
Rep.Z 
Z1 9 17 25 33 41 49 БІ 
Z2 7 18 26 34 38 43 53 
Z3 1 10 27 35 36 47 55 
Z4 2 11 19 29 42 44 56 
Z5 3 12 20 28 37 45 50 
26 4 18 21 22 30 46 52 
27 5 14 15 23 31 39 54 
Z8 6 8 16 24 32 40 48 


PLANS 437 
Plan 10.15 8 X 9 rectangular lattice 
Block Rep. X Rep. Y 
xi 1239 4 5 6 7 8 ҮІ 9 17 25 33 41 49 57 65 
x2 10 1 12 13 14 15 16 Y2 1 18 26 34 42 50 58 66 
x3 17 18 19 20 21 22 23 24 ҮЗ 2 10 27 35 43 51 59 67 
ха 35 26 27 28 29 30 31 32 Y4 3 11 19 36 44 52 60 68 
X5 33 34 35 36 37 38 39 40 Y5 4 12 20 28 45 53 61 69 
X6 1 42 43 44 45 46 47 48 Y6 5 13 21 29 37 54 62 70 
x7 49 50 51 52 53 54 55 56 ҮТ 6 14 22 30 38 46 63 71 
xg 57 58 59 60 61 62 63 04 ҮЗ 7 15 23 31 39 47 55 72 
X9 65 66 67 68 69 70 71 72 Үз 8 16 24 32 40 48 56 64 

Rep. Z 

71 16 23 30 37 45 52 59 66 
72 2 17 32 39 46 54 61 68 
Z3 4 11 26 33 48 55 63 70 
24 6 13 20 35 42 49 64 72 
25 8 15 22 29 44 51 58 65 
Z6 1 9 24 31 38 53 60 67 
Z7 `3 10 18 25 40 47 62 69 
Z8 ^5 12 19 27 34 41 56 71 
Z9 7 14 21 28 36 43 50 57 
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Plan 10.16 
Block 


Х1 
Х2 
Хз 
ХА 
X5 
X6 
X7 
X8 
х9 
X10 


21 
22 
28 
24 
25 
26 
27 
28 
279 
210 


Вер. Х 
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9 X 10 rectangular lattice 


123456 


10 


11 


12 


13 


14 15 


19 


20 


23 24 


32 33 


41 42 


50 51 
59 60 


68 69 
77 78 


86 87 


Rep. Z 


3171318118185 


41 


51 61 
52 62 


53 63 
54 55 
46 56 
45 57 
37 47 


38 48 
39 49 
40 50 


КЕЕ ЕЕ ЕЕ 


Ү1 
Ү2 
ҮЗ 
Y4 
Y5 
Y6 
ҮТ 
Y8 
Y9 
Y10 


- 
© 
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CHAPTER. LÀ 


Balanced and Partially Balanced 
Incomplete Block Designs 


11.1 Balanced Incomplete Blocks 


The balanced lattices described in the last chapter are а particular 
group of a general class of designs known as balanced incomplete blocks. 
AII these designs have the property that any pair of treatments appears 
together equally often within some block. Thus in plan 11.1 every pair 
of treatments appears together once in the same block, in plan 11.4 twice, 
and in plan 11.5 four times. This property insures that the same stand- 
ard error may be used for comparing every pair of treatments; it also 
facilitates the statistical analysis, since any treatment total is adjusted 
in a single operation for all the blocks in which the treatment appears. 

The construction of the designs presents interesting mathematical 
problems, (11.1), (11.2), (11.3). Although a design can be found for any 
number of treatments, t, and any size of block, k, most of these are of no 
interest for our purpose, since they require too many replications. The 
plans in this chapter have been restricted to those in which the number 
of replicates does not exceed 10. 

Тһе balanced lattices are the particular set of designs for which the 
relation 2 = k? holds. They also have the property that the blocks can 
be grouped in separate replications. The majority of the balanced in- 
complete block arrangements do not possess this property, which is pos- 
sible only when ¢ is a multiple of К. ‘A few designs can be sorted into 
groups which comprize two or more replicates each. 

In most plans, the block contains six or fewer units. Accordingly, the 
designs are adapted for experiments in which the appropriate size of 
block is small. They have been applied in greenhouse pot experiments, 
where the block is restricted to the width of the bench; in experiments 
on plant virus diseases, where the block consists of a small number of 
leaves on each plant; in experimental cookery where there is a limited 
number of stoves, in tests of mosquito repellents, reference (11.4), where 
the block consists of the two exposed arms of a subject; and in 
nutritional experiments where each child constitutes a block, 3 different 
foods being given during the 3 terms of the school year. 

439 


440 BALANCED INCOMPLETE BLOCK DESIGNS 111 


Examples from field experimental work occur in blueberry fertilizer 
trials where, owing to the uneven growth of different bushes, only a small 
number of them can be grouped into a homogeneous block. A similar 
consideration may limit the size of block in experiments on the control 
of fruit pests, where the tree constitutes the experimental unit. 

A few types of factorial design—the 5 X 2, 5 X 3,7 X 3, and 7 X 4— 
can be arranged in balanced incomplete blocks with the result that main 
effects and interactions are confounded to the same extent. 


iiia Balanced Incomplete Blocks in Taste and Preference Testing. 
These designs have found fruitful applications in experiments in which 
individuals are asked to make a comparative rating of different objects 
that are presented to them. Examples are the rating of different, ways 
of preparing a food as to palatability, different colors in which some 
article is made as to acceptability, and different occupations as to their 
Social status. In most of these experiments the individual finds it in- 
creasingly difficult to reach a satisfactory rating as the number of ob- 
jects presented to him becomes larger. The optimum number of items 
that can be rated varies with the type of experiment, but it is often 
less than the total number that are to be compared in the experiment. 

One solution to this difficulty is to arrange the objects in a balanced 
incomplete block design. For instance, if there are 10 objects to be 
rated and it has been found that an individual should not be asked to 
compare more than 5 items at a time, the design for 10 treatments in 
blocks of size 5 (plan 11.17) is appropriate. Each individual consti- 
tutes an incomplete block. Sometimes the individual rates only one 
block, sometimes several blocks, with intervening rest periods. 

These applications have produced а number of interesting problems 
in statistical analysis. Bradley (11.82) gives an introductory review. 
The individual may merely rate the objects in order of preference 
(e.g., 1, 2, 3), so that the analysis must be based on ranked data. Dur- 
bin (11.92) develops the analogue of the F-test for ranked data, and 
Bradley and Terry (11.102, 11.112) create a mathematical model which 
relates the ranked data to a continuous scale of preferences. 

When the observer is asked to rate on a fixed scale, e.g., 1, 2, 3, +++ 10, 
it is often found that the rating assigned to an object depends on the 
other objects with which it is being compared in the incomplete block, 
being, for instance, lower if the other objects are “good” than if they 
are “poor.” This suggests the presence of a type of intra-block correla- 
tion that is not provided for in the standard analysis of variance of these 
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designs. Calvin (11.12a) has extended the mathematical model and 
the analysis so as to take account of this correlation. He shows that 
this analysis becomes simpler if the design has an additional property 
of symmetry—namely, that every trio of treatments appears equally 
often within the same block—and gives a catalogue of doubly balanced 
designs of this type. 


11.2 Comparisons with Other Designs 


Much experimental material is of the type exemplified above, in that 
the natural grouping for use as a block accommodates only а small num- 
ber of units. Where the number of treatments exceeds the number of 
units in a block, there are three alternatives to the use of a balanced 
incomplete block design. (1) A common treatment is placed in each 
block, so as to provide a basis for comparison between treatments which 
are in different blocks. (2) The treatments are divided into groups such 
that the number of treatments per group is equal to the number of units 
per block. "Тһе groups are compared іп a split-plot arrangement (chap- 
ter 7). (3) Complete replicates are assembled by the grouping of units 
from more than one block, in order that a randomized block design may 
be used. The merits of the first two alternatives were discussed in sec- 
tion 9.5. 

The relative efficiency of randomized blocks and balanced incomplete 
blocks has been discussed by Yates (11.5). Balanced incomplete block 
designs which are arranged in replications cannot be appreciably less 
accurate than randomized blocks. With balanced incomplete blocks 
which cannot be arranged in replications, a comparison with randomized 
blocks is not easy except by the use of uniformity data. Certain соп- 
clusions can be drawn from the nature of the experimental material. If 
there is some association among the members of different blocks, it may 
be possible to form replicates which are only slightly less homogeneous 
than the blocks. For example, suppose that in an animal experiment the 
litter is the natural block. If the responses of the animals are influenced 
by their genetic constitution and if replications are constructed from 
closely related litters, the experimental error may be only slightly greater 
within a replication than within a block. In this event randomized 
blocks may be the more accurate layout. 

Sometimes the investigator has no information about any associations 
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between the groups which constitute the blocks; he knows merely that 
the experimental responses are likely to differ from group to group. In 
these circumstances Yates (11.5) has shown that incomplete blocks (with 
the recovery of inter-block information) are more accurate on the ауег- 
age except when the blocks are ineffective. There is the additional ad- 
vantage that the block differences can be measured as a guide to the de- 
sign of future experiments, whereas with replications composed of units 
from several blocks it would be difficult to estimate the block effects. 

It is worth noting the efficiency factor (E) of any balanced incomplete 
block design whose use is under consideration. Thisfactor, which is shown 
as a decimal fraction in table 11.3 (p. 469) and in the plans, is a lower 
limit to the efficiency of balanced incomplete blocks relative to ran- 
domized blocks. With 37 treatments in blocks of 9 units, for instance, 
the efficiency factor is 91%, so that the loss of efficiency could not exceed 
9%. The loss equals 9% only in the rather unlikely situation in which 
the replications of 37 units are as uniform as the blocks of 9 units, yet the 
variation among blocks is large compared to that within blocks. With 
any reasonable prospect that. the blocks are more homogeneous than 
the replicates, the incomplete block design is to be preferred in this 
case. On the other hand, with 7 treatments in blocks of 2 units, the 
efficiency factor is only 58%. In this case incomplete blocks will be 
more accurate only if there is a substantial reduction in error variation 
because of the reduction in block size. 


11.3 Arrangement of Experimental Material 


Units within the same incomplete block should be as homogeneous as 
possible. For those designs which can also be grouped in replications, it 
is advisable to place similar blocks, so far as these can be discerned, 
within the same replicate. 

Certain of these designs can be rearranged to form an incomplete latin 
square (chapter 13) in which variation associated with two types of 
grouping can be eliminated from the experimental errors. The experi- 
menter should consider whether advantage can be taken of this double 
control so as to reduce the error. 


114 Randomization 
The steps are: 
1. Rearrange the blocks at random. (If the design is arranged in 
complete replications, the blocks are randomized only within each repli- 
cation and the replications are kept separate.) 
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2. Randomize the positions of the treatment numbers within each 
block. 
3. Assign treatments at random to the numbers in the plan. 


11.5 Statistical Analysis 


In order to present the shortest computational methods, it is ad- 
visable to divide the plans into five types, which are shown in the index 
at the end of this chapter. 


11.51 Type I. Designs Arranged in Replications. We will analyze 
an example with 6 treatments in blocks of 2 (plan 11.3). The experiment, 
was conducted by Dr. Pauline Paul at Iowa State College (11.6). Its 
object was to compare the effects of length of cold storage on the tender- 
ness and flavor of beef roasts. Six periods of storage (0, 1, 2, 4, 9, and 
18 days) were tested: these are denoted by treatment symbols 1, 2, 3, 
+++, 6, respectively. 

Thirty roasts from the round of an animal were used. Four muscles 
each provided 6 roasts, while 3 muscles each provided 2 roasts. The 
roasts on any muscle group themselves naturally into pairs, since to each 
roast on the left side of an animal there corresponds another roast on 
the right side. From previous experience it was believed that the 2 
roasts in any pair would give closely similar results. Variation among 
different pairs from the same muscle was expected to be somewhat larger, 
and variation among muscles to be still larger. 

These opinions prompted the-use of a design in blocks of 2, each block 
comprizing the left and right roasts in a pair. When grouping the blocks 
into replications, it was natural to put roasts from the same muscle into 
the same replicate. With the first 4 muscles a separate replicate could 
be made from each muscle. The remaining replication consisted of the 
3 smaller muscles. 

The plan and the scores for tenderness are given in table 11.1. Scoring 
was done by 4 judges, each marking on 8 scale from 0 to 10. The scores 
shown are their totals (out of 40), a high score indicating very tender 
beef. 

The following symbols are used: t = 6 = the number of treatments; 
k = 2 = number of units per block; r = 5 = number of replications; 
b = 15 = number of blocks. 

1. Find the block totals, B, the replicate totals, the grand total, G, 
and the treatment totals, Т, placed below the plan. 


BALANCED INCOMPLETE BLOCK DESIGNS 


TABLE 111 SCORES ror TENDERNESS OF BEEF 


Rep. I 
B 
(0 7 (217 24 
(326 (425 51 
(5) 33 (6) 29 62 
137 
Rep. IV 
B 
(0 25 (5) 40 65 
(2) 25 4 34 59 
(3) 34 (6) 32 66 
190 
Storage Treat- 
time — ment 
(days) no. T 
0 1 70 
1 2 115 
2 3 132 
4 4 139 
9 5 158 
18 6 155 
С = 769 


Rep. II 


(1) 17 (3) 27 
(2) 23 (5) 27 
(4) 29 (6) 30 


Rep. У 


(1) 11 (6) 27 
(2) 24 (3) 21 
(4) 26 (5) 32 


1538 0 


и 
19 
24 
17 
15 

-24 
—51 


0 


Вер. Ш 
В 
(1) 10 (4 25 35 
(2) 26 (6) 37 63 
(3) 24 (5) 26 50 
148 
Adj. 
T + ри means 
71.8 14.4 
117.3 23.5 
133.6 26.7 
140.4 28.1 
155.7 31.1 
150.2 30.0 
769.0 


2. For each treatment, calculate the following quantities: 


B, = total of all blocks in which the treatment appears 
Q=kT —B, —2T — B, 
= й4-ЮТ-(-108 + (k — 1)G = 4T — 8B, +6 


The total of the Q’s and the W’s should each be zero. 
3. The analysis of variance is as follows: 


Replications 


Treatments (unadj.) 


Blocks within reps. (adj.) 


Intra-block error 


Total 


d.f. 
6-1) 
4-1) 
(b — 7) 


(r—t—b--1) 10 
— 


(tr — 1) 


11.51 
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The only sum of squares requiring special instructions is that for blocks 
within replications, adjusted for treatment effects. Since we do not 
know any simple way of finding this directly, it is obtained by a round- 
about method. Calculate in the usual way the unadjusted sum of squares 
for blocks within replications: this comes to 753.0. Then compute the 
sum of squares for treatments, adjusted for blocks, which is 


¢-DDEF_ 5 
wk- (2)(6)(5)() 
The adjusted sum of squares for blocks is then given by the following 
combination of sums of squares 
Blocks (unadj.) + treatments (adj.) — treatments (unadj.) 
753.0 + 520.2 - 1059.8 = 213.4 
4. Calculate the weighting factor 
r(Ey — Е.) 
ЕСЕКТЕН ЕТЕНЕ 
5(21.34 — 7.73) 
= (80/2134) + 00073) — 
The adjusted treatment totals are 
T+ BW 
and the means per unit are found on division by r. 
5. Тһе effective error variance per unit is estimated as 
ЕЛ + (t — Bu] = 7.73801 + (4)(0.0948)] = 10.66 
We derive t-tests from this figure by the ordinary rules. For instance, 
the standard error of the difference between two adjusted totals is 
"V/2r(10.66), or 10.32; for the difference between two adjusted means the 


standard error is У (2) (10.66)/r, or 2.06. 

6. If an F-test is desired, caleulate the sum. of squares of the adjusted 
treatment totals. This figure, 4718.0, is divided by 7(¢ — 1), or in this 
case 25, in order to find the mean square on & unit basis, The mean 
square, 188.7, is compared with the effective error m.s., 10.66, which is 
assigned (tr — t — b + 1), or 10, d.f. The F-ratio, 188.7/10.66, is highly 
significant. The test is not exact, because it ignores the effects of errors 
in the weighting factor д. 

In an experiment of this type, interest centers on the shape of the 
response curve rather than on comparisons between pairs of treatments. 


- [(66)? + (11)? +-- -+ (22)2] = 520.2 
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The adjusted means indicate the common “diminishing returns” curve, | 
An exponential of the form у = A — Be~*, where t is the time of stor- 
age, fits the data satisfactorily. A parabolic regression on t, which is 
easier to compute, will be found to be inadequate. The conclusion ig 
that storage up to about a week increases tenderness. 

7. The gain in precision over randomized blocks is estimated аз fol- 
lows. By pooling the sums of squares for blocks within replications and 
intra-block error, we obtain an estimate of the error m.s. that would have 
applied with randomized blocks. This figure, 14.54, is compared with 
the effective error m.s., 10.66. The relative precision is estimated аз 
14.54/10.66, or 136%. Since the randomized block design has 20 
error d.f. as against 10 for the design actually used, this estimate should 
be reduced to 126% by the adjustment described in section 2.31. 


11.52 Type П. Designs Arranged in Groups of Replications. In our 
first edition, only 2 plans (11.6 and 11.13) were of this type. Since then, 
Rupp (11.7a) has shown that 8 additional plans can be arranged in 
groups of replications. The analysis is similar to that in the previous 
section. Two changes should be noted. If c is the number of groups, 
the d.f. in the analysis of variance subdivide as shown below. 


d.f. тв. 
Groups (c — 1) 
Treatments (unadj.) 46-1) 
Blocks within groups (adj.) (b — с) Еь 
Intra-block error ("-і-%5--1) Е, 
Total (tr — 1) 


"The second change is that the weighting factor и becomes 
"o (b — o — E) 
t(k — 1) — QE, + (t — k) — c — t + DE, 
11.53 Type Ш. Designs Not Arranged in Replications or Groups of 
Replications. Here again the method of section 11.51 may be followed 


except that no replication effects appear. The analysis of variance now 
reads as follows. 


d.f. m.s. 
Treatments (unadj.) (t — 1) 
Blocks (adj.) (b — 1) Es 
Intra-block error ("-і-%-1) Ee 


Total (tr — 1) 
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Аз before, the sum of squares for blocks (adjusted) is found by a sub- 
traction process. The weighting factor becomes 
"s (b — 1)(Еь — E) 
t(k — 1)(b — ПЕ + @— k)(b — E. 


Otherwise the method of section 11.51 applies. Of course, the gain in 
accuracy relative to randomized blocks cannot be estimated. A worked 
example with # = 9, k = 4, has been given by Yates (11.5, 11.7). His 
method of calculating the adjusted sum of squares for blocks is slightly 
different from ours, since he divides this sum of squares into two com- 
ponents so that the calculation becomes partially self-checking. Results 
should be identical by either method. 


11.54 Type IV. Experiments with £ = b. These designs, which can- 
not be arranged in replications, are covered by the computing instruc- 
tions given in the previous section. However, the analysis simplifies 
slightly when £ = b, because the sum of squares for blocks (adjusted) 
can be found directly. The data in table 11.2 are from an experiment on 
corn hybrids with t = b = 13, r =k = 4. For field experiments a de- 
sign that can be laid out in separate replications is usually preferred. 
In this experiment, conducted in 1943 by the North Carolina Agricultural 
Experiment Station, it was desired to compare 10 hybrids, all potentially 
suitable for commercial use, at a number of places throughout the state, 
and at each place to test in addition 3 standard varieties that were 
adapted to that place. With 13 treatments no design that ean be put into 
replieations is available, and since 4 was a convenient number of replica- 
tions, the design with Ё = 13, т = k = 4 was chosen. Randomized 
blocks might have been equally good. Incidentally, the plan used 
is different from that given in plan 11.22, a different method of con- 
struction having been employed. Both plans have the same structural 
properties. 

1. Find the block totals, B, the treatment totals, T, and for each 
treatment find the total B, of all blocks in which the treatment appears. 
Time is saved if T and B, are found simultaneously. The B; values should 
add to k times the grand total G. 

2. For each treatment find 


W-(—ET-(-1B + (Е — 06 
= 9T — 12B, + 3G (іп this example) 
The W’s should add to zero. 
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TABLE 11.2 PLAN AND YIELDS OF CORN (POUNDS PER PLOT) 
Block 
Block totals 
1 (3 25.3 (6) 19.9 (9) 29.0 (11) 24.6 98.8 
2 (3) 23.0 (4) 19.8 (8) 33.3 (12) 22.7 98.8 
3 (10) 16.2 (11) 19.3 (12) 31.7 (13) 26.6 93.8 
4 (2) 27.3 (5) 27.0 (8) 35.6 (11) 17.4 107.3 
5 (7) 23.4 (8) 30.5 (9) 30.8 (10) 32.4 117.1 
6 (4) 30.6 (5) 32.4 (6) 27.2 (10) 32.8 123.0 
7 а) 34.7 (5) 31.1 (9) 25.7 (12) 30.5 122.0 
8 (3) 34.4 (5) 32.4 (7) 33.3 (13) 36.9 137.0 
9 (1) 38.2 (2) 32.9 (3) 37.3 (10) 31.3 139.7 
10 (2) 28.7 (4) 30.7 (9) 26.9 (13) 35.3 121.6 
1 (1) 86.6 (4) 31.1 (7) 31.1 (11) 28.4 127.2 
12 (1) 31.8 (6) 33.7 (8) 27.5 (13) 41.1 134.4 
13 (2) 30.3 (6) 31.5 (7) 39.8 (12) 26.7 127.8 
1548.5 
№_. T В, Ww Т+ Т 
1 141.3 523.3 —362.4 136.7 
2 119.2 496.4 —238.5 116.2 
Б] 120.0 474.3 33.9 120.4 
4 112.2 470.6 8.1 112.3 
5 122.9 489.3 —120.0 121.4 
6 112.3 484.0 —151.8 110.4 
7 127.1 509.1 —319.8 123.0 
8 127.2 457.6 299.1 131.0 
9 112.4 459.5 143.1 114.2 
10 112.7 473.6 — 23.4 112.4 
п 89.7 427.1 327.6 93.9 
12 111.6 442.4 341.1 115.9 
13 139.9 486.8 63.0 140.7 
1548.5 6194.0 0 1548.5 
3. The analysis of variance is: 
d.f. 8.8. m.s. 
Treatments (unadj.) (t—1) 12 542.67 
Blocks (adj.) (b — 1) 12 475.27 39.615 
Intra-block error (tr —2t+1) 27 538.21 19.93E, 
Total (tr — 1) 51 1556.15 


The sum of squares for blocks 


(adjusted) is found directly as the sum 


of squares of the W’s, divided by tr(t — k)(k — 1), or in this case 


(4)(13)(9)(8) = 1404. 
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4, The weighting factor и simplifies to 
Ey — Е, 19.68 
u= = 
i(k — ПЕь (13)(3)(39.61) 


The adjusted treatment totals are the quantities (Т + и’). 
5. As before, the effective error m.s. per unit is taken as 


ЕДІ + (t — ku] = 19.93[1 + (9)(0.0127)] = 22.2 
Other features of the analysis are the same as in previous sections. Yates 
(11.5) presents an example with # = 21, = 5. 


11.55 Type V. Small Experiments. In certain of the smaller de- 
signs, the numbers of degrees of freedom in the mean squares for blocks 
and intra-block error are rather small. Consequently, the estimates of 
the relative weights assigned to inter- and intra-block comparisons are 
poor. In these cases, unless the whole plan has been repeated to secure 
extra replication, it is best to use the original method of analysis that 
Yates developed (11.8). This method adjusts the treatment means for 
differences between the blocks, but makes no use of inter-block informa- 
tion. The analysis is a little simpler than when inter-block information 
is used. 

For purposes of illustration, this analysis will be applied to the ex- 
ample in the previous section, with 4 = b = 13, т = k = 4, though in 
practice inter-block information should be utilized in an experiment of 
this size. The data are presented in table 11.2. 

1. Find the block totals, В, and the treatment totals, Т. For each 
treatment find the total В, over all blocks in which the treatment appears. 

2. For each treatment compute 

0-ЕТ-В,-4Т-В, 
r 4-1 T 12 
"o 1) Г OAC) 
where т = 1548.5/52 = 29.78 is the mean of the whole experiment. 
The / values are the adjusted treatment means. The Q values should 
add to zero. 
3. The analysis of variance is shown below. 


= 0.0127 


Q = т + 0.07690 


d.f. 8.8. m.s. 
Blocks (unadj.) (b —1) 12 689.38 
Treatments (adj.) (t— 1) 12 328.55 27.38 
Intra-block error (r—t—b+1) 27 538.22 19.93Е, 


Total (tr —1) 51 1556.15 


Note that the blocks sum of squares is unadjusted. It is found in the 
usual way as the sum of squares of deviations of the B’s, divided by k. 
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This rule applies even when the experiment is arranged in replications 
or groups of replications, though if the experimenter wishes, he may di- 
vide the blocks sum of squares into that due to replications (or groups) 
and that due to blocks within replications. 

The adjusted sum of squares for treatments is 


4-І ah 12 А И 1 
"o mo pede 7 Gase [(41.9)? + (19.6)? +- - -+ (72.8)?] 
= 328.55 


As usual, the intra-block error s.s. is found by subtraction. 

4. For t-tests, the effective error variance is 

Е, [i " t—k а-ы LES 1) Е (19.93) (12) (4) 
t(k — 1) i(k — 1) (13) (8) 

For the variance of the difference between two adjusted treatment 
means, we multiply this quantity by 2/r. 

5. The analysis above provides an exact F-test of the adjusted treat- 
ment means. The F-ratio is 27.38/19.93, with 12 and 27 d.f. 


11.56 Missing Data. For reasons given in section 10.13, it is recom- 
mended that the analysis of incomplete data be carried out by inserting 
for the missing observations values which minimize the intra-block 
error s.s. In our notation the formula, first given by Cornish (11.9), is 
ir(k — 1)B + k(t — 1)Q — (t — 00 
(k — 1)[tr(k — 1) — kt — 1)] 
where B is the total of the block containing the missing observation, Q is 
the Q value (= kT — В) for the treatment that contains the missing 
value, and Q' is the sum of the Q values for all treatments that are in the 
block with the missing value. 

Example 1. In the experiment on roast beef in section 11.51, suppose 
that treatment (5) in replication IV is absent. The first step is to find 
the block totals, the treatment totals, the В, and the Q values as in the 
ordinary analysis. An z should be inserted for the missing observation. 
The data read as follows. 


= 24.5 


"Treatment 
number Th В, 0-%Т-В, 
1 70 166 +2 —26—х 
2 115 241 —11 
3 132 256 8 
4 139 262 16 
5 118 += 245 += — 942 
6 155 288 22 


Totals 729 + = 1458 + 2r 9 
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Since the block with the missing value contains treatments (1) and (5) 
0 = —26 — 9 = —35 
Hence, since B is 25, 
_ 30B + 109 — 50' 6B + 20 — Q' 


т = 
20 4 


e (6)(25) + (2)(—9) — (—35) in 
4 


On substituting this value in all places where an z occurs in the table 
above, we are ready to proceed with the analysis of variance. 

Example 2. For an example with 2 missing observations, suppose 
that treatments (8) and (13) are missing from block 12 in the corn ex- 
periment in section 11.54. Denote the values to be inserted by x and y, 
respectively. To save space, the preliminary data are presented only 
for the treatments that occur in the same block as the missing observa- 
tions; these are treatments (1), (6), (8), and (13). 


42 


"Treatment 
number Т В, Q = 4T — B: 
1 141.3 454.4 += +у 110.8 -2-у 
6 112.3 415.1 += +y 841-:-у 
8 99.4 + х 388.7 +2 +y 8.9 + 35 — y 
13 98.8 +y 417.9 +2 у —22.7 - 2 + 3y 


131.1 = Q’ 
В = 65.5 += фу 


In the 3 replications in which it appears, the average for treatment (13) 
is 33: this will be chosen as the first approximation to y. Then for z we 
have 


B = 65.5 + 33.0 = 98.5; Q = 8.9 — 33 = —24.1; Q = 131.1 
Since = 13, k =r = 4, 
- 156B + 48Q — 120' Е 13В +40 — Q' 
ЖЫ 324 14y Jo [i87 
—24.1) — (131.1 
_ (13) 98.5) + Oe 24.1) — 0310) _ до 


We now substitute = 39.0 in the preliminary data and obtain a 
second approximation to y. For this we have 


B = 65.5 + 39.0 = 104.5; = —22.7 — 39.0 = —61.7 
О’ = 131.1 
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Note that Q’, which does not involve z or y, remains fixed throughout the 
calculation. The second approximation to y is 
(13)(104.5) + (4)(—61.7) — (131.1) 
у= = 
27 


36.3 


The next 2 approximations will be found to be т = 40.1, у = 36.7, 
which may be used as the values to be inserted. 

The rest of the analysis proceeds as usual; the intra-block error de- 
grees of freedom are reduced by 1 for each missing observation. In the 
calculation of standard errors for t-tests of the adjusted treatment 
means, the method given in section 10.13 should be followed. 


11.57a Statistical Analysis for Repetitions of the Designs. With some 
of the smaller plans, the whole plan may be repeated p times when extra 
replication is wanted. In order to continue to use as many of the for- 
mulae already given as possible, it is advisable to let r denote the total 
number of replications and b the total number of blocks. If the basic 
design as given in the plans has r’ replications and b’ blocks, then r = 
pr’ and b = pb’. 

Type I. Designs arranged in replications. With these definitions of r 
and b, all the steps and formulae given in section 11.51 remain the same. 
As before the weighting factor is 


т(Е, Em Е.) 


PU Ж — NE, + kb —r—1t4 DE, 

Formulae will not be presented for the designs arranged in groups of 
replications, Type 11, since these designs will seldom be duplicated. 

Type III. Designs not arranged in replications or groups of replications. 
It is assumed that the repetitions will be kept apart during the course 
of the experiment. The (b — 1) d.f. between blocks then subdivide into 
(p — 1) d.f. between repetitions and (Б-р) d.f. between blocks within 
repetitions. In the analysis of variance, the d.f. are allocated as follows: 


df. 
Repetitions (р—1) 
Treatments 4-1) 
Blocks within repetitions (5- р) 


Intra-block error (tr —t-—b+1) 
Total (tr — 1) 
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The analysis follows the pattern described in section 11.53. Calculate 
the total sum of squares and the sums of squares for repetitions, treat- 
ments (unadjusted), and blocks within repetitions (unadjusted) in the 
usual way. The sum of squares for treatments (adjusted) is 


(210570 
kir(k — 1) 
as given in section 11.51. The sum of squares for blocks within repeti- 
tions (adjusted) is obtained by the indirect route as 
Blocks within repetitions (unadj.) 
+ treatments (adj.) — treatments (unadj.) 
The weighting factor becomes 
3s (b — р)(Еь — Е.) 
ilk — 1)(0 — р)Еь+ (t — kK) —4— p + DE. 
where Е, is the mean square for blocks within repetitions (adjusted). 
No further changes are needed. 
Туре ТУ. Experiments with t = b. It is suggested that the same 
method as for Type III be used in these experiments. 
Type V. Small experiments. The procedure described in section 11.55 
is still valid, remembering that r and b are the total numbers of replica- 
tions and blocks, respectively. 


11.6а Partially Balanced Incomplete Block Designs 


11.61а Description. As mentioned previously, Bose and Nair (11.1a) 
showed that an incomplete block design which lacks full balance must 
possess certain properties of symmetry if the statistical analysis is not 
to be too cumbersome. These properties will be illustrated by the 3 X 3 
triple lattice, which is а partially balanced design in the sense of Bose 
and Nair. 


TABLE 11.14 А 3 ХЗ TRIPLE LATTICE 


Block Rep. I Rep. II Rep. III 
() 123 (0147 (0159 
(2) 456 (5) 258 (8) 7 2 6 
@ 789 (0 369 (9) 48 3 


First, there are some properties that hold in all the designs which we 
have considered: 
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(1) Every block contains k units. Every treatment occurs г times and 
no treatment appears more than once in a block. 


Consider treatment 1. The other treatments that appear in the same 
block as treatment 1 are treatments 2 and 3 (block (1)), treatments 4 
and 7 (block (4)), and treatments 5 and 9 (block (7)). There are two 
other treatments (6 and 8) which do not appear in the same block as 
treatment 1. Similarly, if we start with any other treatment, we find 
six treatments that are in the same block with it, and two that are not. 
Pairs of treatments that occur in the same block are called first asso- 
ciates; pairs that do not are called second associates, and the design is 
said to have two associate classes. 

This requirement may be put in general terms as follows (for a design 
with two associate classes): 


(2) Every pair of treatments occurs together in either Ау or X; blocks. 
Pairs that occur together in à; blocks are called first associates; pairs 
that occur together in Ар blocks are called second associates. 


For the triple lattice, 4; = 1 and Аҙ = 0, as we have seen. Іп gen- 
eral, ^; and А» can be any two whole numbers that are different. 

We must now dig a little deeper. Take a pair of treatments that are 
first associates, say 1 and 2, and consider the relationships of the other 
treatments to these two. Treatments 3, 4, 5, 7, and 9 are in the same 
block as 1, while 6 and 8 are not. For treatment 2, the other treatments 
that are in the same block are 3, 5, 6, 7, and 8, while 4 and 9 do not ap- 
pear in the same block. These relationships may be presented in a 
2 X 2 table. 


TABLE 11.28 RELATIONSHIPS OF THE OTHER TREATMENTS TO 1 AND 2 


Relation to 1 
lst associate | 2nd associate 


186 associate 3,5,7 6, 8 


Relation 
to 2 


2nd associate 4,9 None 


Treatments 3, 5, and 7 are first associates of both 1 and 2; treatments 
4 and 9 occur in the same block with 1 but not with 2, and hence are 
first associates of 1 and second associates of 2, and so on. 
The numbers of other treatments falling into the four cells in table 
11.28 are, respectively, 
8%2 
20 
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The third basic property of a partially balanced design is that these four 
numbers must remain the same, no matter which pair of first associates 
we start with. The reader may verify, for instance, that the same set 
of four numbers is obtained when starting with treatments 3 and 9. 

If this process of classification is repeated, starting with a pair of 
treatments like 1 and 6 that are second associates, we find a different set 
of four numbers, namely 

6 0 

01 
Once again, however, these same four numbers will emerge for any pair 
of second associates with which we start. In technical language, this 
property is expressed as follows: 


(3) Given any two treatments that are ith associates, the number of 
treatments that are jth associates of the first treatment and kth asso- 
ciates of the second treatment is the same no matter which pair of ith 
associates we start with. This number is denoted by the symbol pj". 
Further, рд! = pij". 


Any design satisfying properties (1), (2), and (3) above is a partially 
balanced incomplete block (pbib) design with two associate classes. 
The most intricate property is, of course, (3). This property makes the 
least squares analysis of the results tractable, and guarantees that only 
two standard errors are required for testing differences between pairs 
of treatments. In practice, these standard errors are often so close to- 
gether that a single error suffices. 

The user of pbib designs need not master these properties. They do, 
however, indicate the difficulties that may arise when “home-made” 
designs are used without an investigation of the relationships among 
the treatments and blocks. The rectangular lattices (section 10.3) are 
not partially balanced in this sense, with the result that 4 different 
standard errors are necessary in the simple rectangular lattice and 7 in 
the triple rectangular lattice, although usually we can dispense with 
most of these in practice. 


11.62a Index to Partially Balanced Incomplete Block Designs. Al- 
though the number of designs is too great to enable us to present the 
individual plans, table 9.2a (p. 392) shows the number of treatments, 
size of block (К), and number of replications (г) for each plan that is 
available, and contains a reference to the source of the plan. This table 
deals with designs in blocks of sizes from 3 to 10. For designs for which 
k = 2 (two treatments per block) see reference (11.3a). 
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In the following sections we present an example of the statistical 
analysis of a pbib design, in order that the reader may appraise the de- 
gree of complexity. The intra-block analysis is little more involved 
than that for a fully balanced design. The analysis with recovery of 
inter-block information takes considerably longer. Thus the designs 
are especially suitable for experimental situations in which the group- 
ing into blocks is highly effective, so that inter-block information is un- 
important and may be neglected. 


11.63а Example of the Intra-block Analysis. This experiment has 15 
treatments arranged in blocks of size 4, with 4 replications. The data 
are the pounds of seed cotton per plot in a uniformity trial on which 
dummy treatments have been superimposed. The plan and yields are 
shown in table 11.3a. 


TABLE 11.3a PLAN AND YIELDS OF COTTON (POUNDS PER PLOT) 


Block 
Block totals 
1 (15) 2.4 (9) 2.5 (1) 2.6 (13) 2.0 9.5 
2 (5) 2.7 (7) 2.8 (8 2.4 (1) 2.7 10.6 
3 (10) 2.6 (1) 2.8 (14) 2.4 (2) 2.2 10.0 
4 (15) 3.4 (1) 3.1 (2) 2.1 (3) 2.3 10.9 
5 (6) 4.1 (15) 3.3 (4) 3.3 (7) 2.9 13.6 
6 (12) 3.4 (4) 3.2 (3) 2.8 (1) 3.0 12.4 
7 (12) 3.2 (14) 2.5 (15) 2.4 (8) 2.6 10.7 
8 (6) 2.3 (3) 2.3 (14) 2.4 (5) 27 9.7 
9 (5) 2.8 (4) 2.8 (2) 2.6 (13) 2.5 10.7 
10 (10) 2.5 (12) 2.7 (13) 2.8 (6) 2.6 10.6, 
11 (9) 2.6 (7) 2.6 (10) 2.3 (3) 2.4 9.9 
12 (8) 2.7 (6) 2.7 (2) 2.5 (9) 2.6 10.5 
13 (5 3.0 (9) 3.6 (11) 3.2 (12) 3.2 13.0 
14 (7) 3.0 (13) 2.8 (14) 2.4 (11) 2.5 10.7 
15 (10) 2.4 (4) 2.5 (8 3.2 (11) 3.1 11.2 


С = 164.0 


This design is of the type known as “group divisible.” The key to 
the design is the following association scheme among the treatment 
numbers. 


а (65611 
Эй: 212 
8 8 13 
4 9 14 
5 10 15 
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Every treatment appears once in a block with all other treatments ex- 
cept the two other treatments in the same row of the association scheme. 
For instance, every treatment except 6 and 11 occurs somewhere in the 
same block as treatment 1. If we regard treatments in the same row as 
first associates, we have Ау = 0, and № = 1. 

This example is also worked in the monograph (11.2a) from which 
the design was taken. The analysis given here follows the same method, 
but utilizes totals instead of means until the final step.* In this design, 


t=15; k=4; r=4; 0 = 15 
1. Find the block totals В and the grand total G and insert them in 


the plan (table 11.3a). 
2. For each treatment, find the treatment total Т and the total В; of 
all blocks in which the treatment occurs. For treatment 1, 


Т =2.6+ 27+ 28+ 30=111 
B, = 9.5 + 10.6 + 10.0 + 12.4 = 42.5 


These quantities can be found simultaneously on a desk computing 
machine by placing the treatment yields on the left of the keyboard and 
the corresponding block totals on the right. Asa check, the sum of the 
T's is G and the sum of ће В/в is kG. The T's and Bys are written in 
the first two columns of the working table 11.4a. 


TABLE 11.48 (COMPUTATIONS FOR THE INTRA-BLOCK ANALYSIS OF A РЫБ DESIGN 


(1) (2) (3) (4) (5) (6) 
Treat- Q" = ЕТ - Bı Adjusted 
ment T Bi =4T-B, 6" 1804 mean 
1 11.1 42.5 1.9 6.1 22.4 2.86 
2 9.4 42.1 —4.5 -0.8 —66.7 2.36 
3 9.8 42.9 -3.7 -4.2 —51.3 2.45 
4 11.8 47.9 -0.7 -0.7 -9.8 2.68 
5 11.2 44.0 0.8 -0.4 12.4 2.80 
6 11.7 44.4 2.4 6.1 29.9 2.90 
7 11.3 44.8 0.4 -0.8 6.8 2.77 
8 10.9 43.0 0.6 —4.2 13.2 2.81 
9 11.3 42.9 2.3 -0.7 35.2 2.93 
10 9.8 41.7 -2.5 -0.4 —37.1 2.53 
n 11.9 45.8 1.8 6.1 20.9 2.85 
12 12.5 46.7 3.3 —0.8 50.3 3.01 
13 10.1 41.5 —1.1 —4.2 —12.3 2.66 
14 9.7 41.1 —2.3 -0.7 —33.8 2.55 
15 11.5 44.7 1.3 -0.4 19.9 2.84 
Totals 164.0 656.0 0.0 0.0 0.0 


* Our results differ slightly owing to ап error іп the total sum of squares in (11.2a) 
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3. From these two columns, form a third column of the values 
Q” = kT — В, = 4T - В, 


The Q” values should sum exactly to zero. (In previous designs we 
have used Q for this quantity instead of Q”; however, the monograph 
(11.2a) uses Q for our Q"/k, and the change in our notation has been 
made to avoid confusion.) 

4. We are now ready to compute the treatment means f, adjusted for 
block effects. The formula is 


rk(k — 1)Ё = (k — e3)Q" + (e — сә)81(0”) (1) 


Tn this formula the quantities c; and со are functions of the structural 
constants of the design, i.e., of the \’s and the ру! of section 11.618, 
The numerical values of сү and с are given with the plan for each de- 
sign contained in references (11.2a) and (11.3a). For this design 


&70; о= т 


The quantity 51 (@”) denotes the sum of the Q”, taken over the first 
associates of the treatment in question. Thus in finding the adjusted 
mean of treatment 1 we add the Q" values for treatments 6 and 11, 
which are the first associates of treatment 1. 

Formula (1) for # holds for any pbib design with two associate classes. 
At this point, however, it is usually possible to shorten the calculation 
slightly by rewriting the formula, the particular method of rewriting 
depending on the type of design. For this design, let 


G" — Sum of Q" over all treatments in the same row in the 
association scheme 


For treatment 1, б” is the sum of the Q” over treatments 1, 6, and п. 
Then it follows that 


G" = S,(Q") + 9" 
Substituting for S,(Q") in terms of G” in the equation for $, we have 
rh(k — 1)Ё = (k — сз — e + c2)Q" + (су — с2) а" 
= (k — с1)0"' + (e — с) 4" (2) 


The advantage of introducing the G” is that there are only five values 
of G” to be computed, one for each row in the association scheme. These 
values are entered in column (4) of table 11.4a. For treatments 1, 6, 
and 11, 

G” = 1.9 +24 +18 = +6.1 
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5. Finally, substituting numerical values for the constants in equa- 


tion (2) for f, с 
(4)(4@)@)Ё = 40” – 156” 
This equation can be written as 
1804 = 15Q" — G” 
The values of 180 appear in column (5) of table 11.4a. They should 
add to zero. To obtain the adjusted treatment means, multiply the 


values in column (5) by 0.005556, the reciprocal of 180, and add the 
general mean, 164.0/60, or 2.733. For treatment 1 this gives 


(0.005556) (22.4) + 2.733 = 2.86 


6. In the analysis of variance, the total sum of squares and the sum 
of squares for blocks are found in the usual way. The general formula 
for the adjusted treatments sum of squares is 


1 mask 2 
15-150 


where the sum is taken over all treatments. This may be written 


1 a Sum of products of columns (5) and (3) 
— 18009" = — 77 
720 > ( dd 720 


TABLE 11.5а INTRA-BLOCK ANALYSIS OF VARIANCE 


d.f. 8.8. тв. 
Blocks (unadj.) b—1 14 4.9233 
Treatments (adj.) 4-1 14 1.564] 0.1117 
Error tw —t—b+1 31 2.6859 0.0866 = E, 
Total “-і 59 9.1733 


For two treatments that are first associates (like 1 and 6), the error 
variance of the difference between their adjusted means is 
2E. (k — с) 8 2(0.0866) (4 — 0) 
т (Е-Ә) 4x l) 
= 0.0577 
For second associates, the error variance of the difference between the 
adjusted means is 
2E, (k — а) _ 2(0.0866) (4 — 15) 
т<" aaie) 
= 0.0539 
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The two variances are so close together that an average variance can 
safely be used for all comparisons between pairs. Since any treatment 
has 2 first associates and 12 second associates, an appropriate average ig 


2(0.0577) + 12(0.0539) 

14 
The standard error, 4/0.0544 = 0.233, with 31 d.f., is used for ¢-tests 
among the adjusted treatment means. In this example, which has 


dummy treatments, t-tests are of no interest. From table 11.5a, the 
F-ratio is 0.1117/0.0866 = 1.29, with 14 and 31 d.f. 


= 0.0544 


11.64а Analysis with Recovery of Inter-block Information. All the 
computations already made in section 11.63a are needed, except the 
adjusted treatment means in column (6) of table 11.4a. We continue 
as follows: 

7. Find the unadjusted treatments sum of squares in the usual way. 
This comes to 3.1383. From this value and the analysis of variance in 
table 11.5a we obtain the adjusted blocks sum of squares by the relation 


Blocks (adj.) = blocks (unadj.) + treatments (adj.) 
— treatments (unadj.) 
= 4,9233 + 1.5641 — 3.1383 = 3.3491 
The mean square for blocks (adjusted) is 
Еь = pe = 0.2392 
14 
The intra-block error mean square (from table 11.58) is 
E, — 0.0866 


If E, is less than or equal to E., proceed no further and use the unad- 
justed treatment means as the final estimates. 
8. Various weighting coefficients are now computed. 


T 
и = — = 11.55 
оу МЇ абаз ЕП) Meaney ws эй; AB cd 
КФ — ЦЕ,-(4- ЮЕ. (56)(0.2392) — (11)(0.0866) 
= 3.62 
А 
ONE Ed Lcd Ls 
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We also require coefficients dı and dy which take the place of the c; and 
сә in the intra-block analysis. These d's depend on W, on the св and 
on other structural constants А, H, ^; and М. For this design 


a =0; оз; A-2í НЕ, X20; Hl 


4, ы SE зщ since c; and ^, are b 
АЕН ЕР? 5 1 1 are both zero 
"m AtW _ (15) 5) + 4(0.46) 
А--тНИ + 2? (45) + 4(27) (0.46) + 16(0.46)? 
4.84 
= 27.06 = 0.1789 


9. To replace the 0” we compute values P". 
” 
Р" = ив, + uq — 75 
where G is the grand total, 164.0. Numerically, 
P" = 3.62В, + 11.550” — 158.315 


The P" are shown in column (1) of table 11.6a. For treatment 1, 


TABLE 11.08 ADDITIONAL COMPUTATIONS FOR RECOVERY OF INTER-BLOCK 


INFORMATION 
(1) (2) (3) 

Treat- Adjusted 
ment p", а” mean 
1 17.480 75.884 2.82 
2 —57.888 —0.553 2.37 
3 —45.752 —62.207 2.47 
4 6.998 —5.552 2.78 
5 10.205 —7,517 2.80 
6 30.133 75,884 2.90 
7 8.481 -0,553 2.79 
8 4.275 —62,267 2.78 
9 23.548 —5.552 2.88 
10 —36.236 —7.517 2.51 
11 28.271 75.884 2.89 
12 48.854 -0,553 3.04 
13 —20.790 —62.207 2,62 
14 —36.098 -5.552 2.51 
15 18.514 -7.517 2.85 


| 5 
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P" = (8,62) (42.5) + (11.55)(1.9) — 158.315 
= 17.480 


The P” add to zero, apart from rounding errors. 
10. The adjusted treatment means # are given by the equation 


ігіш + w(k — 1)}# = (Е — d3)P" + (а, — ауа (Р”) 


where S; denotes summation over the first associates of the treatment in 
question. This is the general formula for all pbib designs with two asso- 
ciate classes. 

For this design the formula can be rewritten, as in step 4 of the intra- 
block analysis, 


гіш + wk — 1)}Ё = (k — ФР" + (di — ауа" 


where the G’” for any treatment is the sum of the P" over the row in 
which the treatment appears in the association scheme at the beginning 
of section 11.63a. The С” values are placed in column (2) of table 
11.68. For treatment 1 


6” = 17.480 + 30.133 + 28.271 = 75.884 
Finally, the adjusted treatment means are 
(k — а) Р” + (dı — d2) @” 
kr(w' + w(k — 1)) kr(w' + w(k — 1} 


The constant M is the general mean and is added so that the adjusted 
means will average to the general mean of the experiment. Numerically 
we have 


M+ 


Adjusted mean = 2.733 + 0.00624P” — 0.00029G"" 


These values appear in column (3) of table 11.6a. 
11. The estimated variance of the difference between the adjusted 
means of two treatments that are first associates, like 1 and 6, is 


2(k — а) 
r(w' + w(k — 1)} 


For two treatments that are second associates, the corresponding for- 
mula is 


= 0.0523 


2(k — də) 
т{ш + w(k — 1)} 


As in the intra-block analysis, an average variance of the difference be 
tween two adjusted treatment means is computed as 


= 0.0499 
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2(0.0523) + 12(0.0499) 


14 = 0.0502 


The method of analysis, as illustrated by this example, is essentially 
the same for all pbib designs with two associate classes, except for slight 
simplifications that can be made at the point at which we introduced 
the G”. 

In some of the designs the blocks can be combined to form replications 
(as in the lattices) or groups of replications. No change is required in 
the intra-block analysis, although, if desired, the (b — 1) d.f. for blocks 
may be partitioned into (7' — 1) d.f. for replication groups and (b — 7”) 
d.f. for blocks within replication groups. For the recovery of inter-block 
information, this partition is necessary. The sum of squares for blocks 
within replication groups (adjusted) is computed as: 


blocks within rep. groups (unadj.) 
+ treatments (adj.) — treatments (unadj.) 

and has (b — r^ d.f. If E, is the corresponding mean square, then 
_ &b-r)-(-H5 
— kb — т)Е, — (t — BE, 
The remainder of the analysis proceeds as in the example. 

In a substantial number of the designs, replications can be formed at 
right angles to the blocks, permitting a two-way elimination of hetero- 


geneity as in the latin square. These designs are discussed in section 
13.48. 


11.7a Chain Block Designs 


11.71а Description. These designs were developed by Youden and 
Connor (11.4a) for experimental situations in which 

(i) The size of block is limited, and the number of treatments to be 
tested considerably exceeds this size, 

(ii) Within the blocks, comparisons are of such high precision that 
only one or two replications are needed. 

The designs have been used in precise experimentation in the physical 
sciences. They are unlike any of the other designs in this book in that 
some treatments are replicated only once while others are replicated 
twice. Although their utility is likely to be restricted by condition 
(ii) above, they are flexible and easy to construct, and adapt themselves 
to situations in which even a partially balanced design, if one exists, 
demands too many replications. 
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The example in table 11.7a, for 17 treatments in blocks of 6 or 5 
units, illustrates the principle of the design. 


TABLE 11.7a А CHAIN BLOCK DESIGN FOR 17 TREATMENTS IN BLOCKS OF 6 OR 5 


"The letters denote the treatments. Capital letters refer to treatments 
that are replicated twice; small letters to treatments that are replicated 
once. The suffixes indicate the blocks. 

Treatments C and D appear in both blocks 1 and 2, and form the link 
between these blocks. Treatments E and F form a similar link between 
blocks 2 and 3, treatments G and H between blocks 3 and 4, and so on 
until we reach treatments A and B, which complete the chain by linking 
blocks 5 and 1. This linking is the basic property of the design. Every 
block contains one group of treatments linking it to the preceding block 
in the chain, and one group linking it to the succeeding block. 

Now as to flexibility. The basic groups like (A, B) may contain 1, 2, 
3 or more letters, 1.е., have 1, 2, 3 or more treatments. This number 
should, however, be the same for all groups. Any number of additional 
treatments that are replicated once, like k, І, m, may be included, sub- 
ject only to the limitations imposed by the size of the block. In table 
11.7a, seven treatments are replicated once. Note that in the first two 
blocks, all 6 units are occupied by treatments, whereas in the last three 
blocks only 5 units are utilized. It is advisable to distribute the extra 
treatments so that block sizes are as equal as possible. 

In the selection of a design for a given number of treatments and units 
per block, the main question to decide is how many of the treatments 
are to be replicated twice. Then use the largest size of basic group that 
will give these treatments double replication and will fit into the blocks. 
Suppose that we have blocks of size 7, and that 9 treatments are to be 
replicated twice and 3 replicated once. The basic group cannot exceed 
3 treatments, since there must be two basic groups in each block. A 
moment's trial shows that а plan сап be constructed with basic groups 
of size 3, as shown in table 11.84. 
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TABLE 11.8a CHAIN BLOCK DESIGN FOR 12 TREATMENTS IN BLOCKS ОҒ 7 


Since the number of error d.f. is sometimes small, it is well to check 
this before adopting a design. This number is 


b(ng — 1) +1 


where b is the number of blocks and ng the number of treatments in a 
basic group. The designs in tables 11.7a and 11.8а have 6 and 7 error 
d.f., respectively. 

When treatments that are replicated once are present, the designs are 
not partially balanced in the sense of Bose and Nair, and several stand- 
ard errors must be computed if t-tests are wanted. This is the chief 
disadvantage of the designs. In allocating treatments to blocks, it is 
worth noting that the standard error of the difference between two treat- 
ment means is smallest for pairs of treatments in the same block. Next 
come pairs of treatments in blocks that are neighbors in the chain of 
blocks; next come pairs in blocks 2 units apart (by the shortest path), 
and so on. Of course, treatments replicated twice are more precisely 
determined than those replicated once. 


11.72a Statistical Analysis of Chain Block Designs. The statistical 
analysis is not difficult. Тһе easiest method is first to estimate the block 
effects. Each treatment mean is then adjusted for the effects of the block 
Or blocks in which it lies. In following the instructions it is helpful 
to think of the blocks as arranged in a cireular chain, each block having 
two neighbors in the chain. 

For each block, calculate the quantity 


D' — (Total of that part of the block containing treatments 
that are replicated twice) — (total of these same treat- 
ments in the neighboring blocks) 

Since it is important that this instruction be clearly grasped, two exam- 
ples are given. For block 1 in table 11.7a 
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D' = (А, + В, + C1 + Di) — (As + Bs + C2 + Р») 
For block 2 in table 11.8a 
D' = (Da + Ez + Fo + Go + He + 12) 
— (Di + Е, + Fy + Gs + Нз + I3) 


The quantities D’ add to zero over the blocks. 
The formula for the estimated block mean 6 varies with the number of 
blocks. Table 11.9a presents these formulae for experiments with from 


TABLE 11.2 FORMULAE FOR THE ESTIMATED BLOCK MEAN IN CHAIN BLOCK 
DESIGNS 
No. of blocks Estimated block mean б 
E] Dy'/3n; 
4 (4Do' + ру + 2-1) /8па 
5 (3D? + Dy’ + 2-1) /5та 
6 (8Dy' + 3Dy' + 3р_у — D3’)/12n2 


3 to 6 blocks. The suffix 0 denotes the block in question; suffixes 1 and 
— 1 refer to the two neighboring blocks. As before, n; is the number of 
treatments in a basic group. In the last formula, Ds' is the D' for the 
block most distant in the chain from the block in question; e.g., when 
computing the effect of block 2, Dz’ refers to block 5. 

НВ, is the estimated mean of block 1 in the original block numbering, 
the estimated mean for a treatment that occurs in blocks 2 and ів 


‚-@-&-® 


2 


where 7 is the total of the two units which have this treatment. For a 
treatment occurring only in block % 


і-Т-% 


In the analysis of variance, the total sum of squares and the sum of 
squares for blocks are computed in the standard way. Note, however, 
that if blocks contain differing numbers of units, we require a weighted 
sum of squares 

B? с? 
where k; is the number of units in the ith block and В; is the block total. 

Instead of finding the adjusted sum of squares for treatments, we 
compute the error sum of squares directly, obtaining that for treatments 
by subtraction. The error sum of squares is found as follows. For each 
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group of treatments that is replicated twice, form differences between 
the two replicates of each treatment, as illustrated below. 


Diff. 
A; A| di 
Bi В| d 
Сі C| d 
Total d 
Then » E 
1 2 
180-9 
2i 


contributes (ng — 1), or in this case 2, d.f. to the error sum of squares. 
Compute and add these terms over all b treatment groups, obtaining 
b(ng — 1) d.f. One additional d.f. comes from the difference between 
the total (over all bloeks) of the upper groups in the block and the 
corresponding total of the lower groups. The square of this difference 
is divided by 2bng. In table 11,84 the sum of squares for this single d.f, is 
(Ay + By + Cr+ Da + Ea + Fo + Gs + Нз + Ia р 
= D, — В, — Fy — 65 — Н» — I3 — Аз — Вз — Сз) 
18 


The partition of d.f. is as follows. 


d.f. 
Blocks b-—1 
"Treatments 51 
Error N -—b—t4+1=)(m—-1) +1 
Total N-—1 


For t-tests, the variance of the difference between two adjusted treat- 
ment means depends on the number of replications which the treatments 
have, and on the shortest distance between the groups in which they 
occur. The formulae may be written: 


Both treatments with 2 reps.: V(diff.) = һ + t) 82 
2 
: fu 
Both treatments with 1 rep.: V(diff.) = 2)1+ a 8? 
2 


8 
One with 1 rep.; one with 2: V(diff.) = 5 {1 <р 


where s? is the error mean square, Table 11.104 gives the factors f for 
most practical cases. 
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TABLE 11.10& Factors FOR OBTAINING VARIANCES IN CHAIN BLOCK DESIGNS 


Treatments fa Л: Л» 
In same group | 0 0 (b — 1)/3 
1 group apart | (b — 2) (b -- 1) (b — 1)/3 


2 groups apart | (3b — 8) (2b — 4) (5b — 9)/3 
3 groups apart | (5b — 18) (3b — 9) (9b — 25)/3 


In finding the shortest distance apart, number the groups as follows. 


Block $ 2 3... b 
Ф 6 0 QA 
бі 6% 6; Ge 
9 9 9з дь 


where g; stands for the groups of treatments that are replicated once. 
In table 11.7a, for instance, A and J are one group apart, since A is in 
Gs and I in Gy. Similarly A (in G5) and 4 (in 92) are 2 groups apart 
while C (in G;) and Ё (in 91) are considered in the same group. 
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TABLE 11.3 INDEX TO PLANS 


t k т b Af E Plan Type 
2 3 6 1 .67 11.1 У 
3 3 4 2 .89 А у 
2 4 10 1 .62 11.2 у 
3 6 10 3 .83 11.1a У 
4 4 5 3 94 4 у 
2 5 15 1 .60 Б 1 
3 5 10 2 .80 4 ш 
3 10 20 4 .80 1.5 T 
4 10 15 6 .90 6 п 
5 5 6 4 .96 x ‚4 
2 6 21 1 .58 11.2а п 
3 3 7 1 78 11.7 У 
4 4 7 2 88 11.8 v 
6 6 7 5 97 4 у 
2 7 28 1 57 11.9 1, 
4 7 14 3 186 11.10 І 
7 7 8 6 .98 е М 
2 8 36 1 56 11.3а II 
4 8 18 3 84 11.11 п 
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TABLE 11.3 INDEX TO PLANS (Continued) 


ё k r b AT Е Plan Type 
9 5 10 18 5 90 11.12 IT 
6 8 12 5 94 11.13 II 
8 8 9 7 98 * IV 
10 2 9 45 1 56 11.14 I 
3 9 30 2 74 11.15 II 
4 6 15 2 83 11.16 Ш 
5 9 18 4 89 11.17 ш 
6 9 15 5 .93 11.18 ш 
9 9 10 8 .99 s IV 
11 2 10 55 1 55 11.44 II 
5 5 11 2 88 11.19 IV 
6 6 1 3 92 11.20 IV 
10 10 п 9 99 ж ТУ 
13 3 6 26 1 72 11.21 II 
4 4 13 1 81 11.22 IV 
9 9 13 6 96 11.23 IV 
15 3 7 35 1 71 11.24 I 
7 7 15 3 92 11.25 IV 
8 8 15 4 94 11.26 IV 
16 6 6 16 2 89 11.27 IV 
6 9 24 3 89 11.28 II 
10 10 16 6 96 11.29 IV 
19 3 9 57 1 70 11.30 II 
9 9 19 4 94 11.31 IV 
10 10 19 5 95 11.32 IV 
21 3 10 70 1 70 11.33 I 
5 5 21 1 84 11.34 IV 
7 10 30 3 90 11.35 ш 
25 4 8 50 1 78 11.36 П 
9 9 25 3 93 11.37 IV 
28 4 9 63 1 78 11.38 I 
7 9 36 2 89 11.39 ш 
31 6 6 31 1 86 11.40 IV 
10 10 31 3 93 11.41 IV 
37 9 9 37 2 91 11.42 IV 
4l 5 10 82 1 82 11.43 II 
57 8 8 57 1 89 11.44 IV 
73 9 9 73 a .90 11.45 IV 
91 10 10 91 1 .91 11.46 IV 


t Number of times that two treatments appear together in the same block. 

* These plans are constructed by forming all possible combinations of the £ num- 
bers in groups of size К. The number of blocks b serves as a check that no group has 
been missed. 


PLANS 471 


PLANS 
Plan 11.1 t=4,k = 2,т «8,5 = 6, à = 1, Е = 67, Type У 
Block Rep. I Rep. II Rep. III 
Gy 1 2 (3 1 3 (5) 1 4 
Q3 4 (02 4 2 83 


Plan 11.2 t =5,k = 2,т = 4,Ь = 10, = 1, Е = 62, Type V 


Reps. I Reps. Ш 

Block and II and IV 
a) 1 2 (6) І 4 
(2) 3 4 (7) 2 3 
(3) 2 5 (8) 3 5 
(4) 1 3 (9) 1 5 
(5) 4 5 (10) 2 4 


Plan 11.1a i = 5, k =3,r = 6, b = 10, à = 3, Е = .83, Type V 


Reps. I, II, Reps. IV, V, 
Block and III and VI 
(2 123 (6) 124 
(2) 125 -1-3-4 
(8) 145 (8) 1 35 
(4) 23 4 (9) 2 3 5 
(5) 345 (0) 2 4 5 


Plan 11.8 t = 6, k = 2, r = 5,Ь = 15, à = 1, E .60, Туре I 


Block Rep. I Rep. II Rep. III Rep. IV Rep. V 
a) 1 2 (4 1 8. вал 4 (10) 1 5 (3) 1 6 
(2) 3 4 (Б) 2 5 (892 6: (11) 2 4 (14) 2 3 
(8) 5 6 (6 4 6 (9) 3 5 (2 8 6 (5 4 5 


Plan 11.4 = 6,& = 3,7 = 56 = 10, = 2, = 80, Type Ш 


Block 

а) 1 25 (0 234 
0) 126 б 235 
(3) 13 4 (8) 2 4 6 
(4) 136 9 3 5 6 


= 
> 
e 
8 
> 
o 
o 


(5) 


472 INCOMPLETE BLOCK DESIGNS 
Plan 11.5 t = 6, k = 3, r = 10, = 20, à = 4, Е = .80, Type I 


а УМ" 474 () 125 Ф 12% 
( 456 @ 356 (6) 3 4 6 (8) 3 4 5 


07 146 (19) 1 56 
(18) 2 3 5 (2) 2 3 4 


Plan 11.6 t= 6, k = 4,r = 10, b = 15, = 6, Е = .90, Type II 

Block Reps. I and II Reps. III and IV Reps. V and VI 
(D 2 2 3 4 (4) 1 2 3 5 Чу Xd 2 3 6 
(2) 1 4 5 6 (5) 1 2 4 6 (8) 1 3 4 5 
(3 2 3 5 6 (6) 3 4 5 6 (9) 2 4 5 6 


Reps. УП апа УПТ Reps. ІХ and Х 
(10 1 2 4--% а) 1 2 5 6 
an. 3 Shas ЦТ * 8. 4 -$ 


2 2 3 4-5 (15) 2 3 4 5 


Plan 11.2a t=7,k=2,r =6,b = 21, л = 1, Е = .58, Type II 


Reps. I Reps. III Reps. V 
Block and II and IV and VI 
(1) i» @ dp. 8 (5) 1 4 
(2) 23- 6 (9 2 4 (6) 2 3 
(3) 3 4 (10) 3 5 (17) 3 6 
(4) 4 7 (11) 4 6 (18) 4 5 
OL $ 42 5 . 7 (9 2 5 
(0 5 6 a3) 1 6 (20 6 7 
(7) 3 7 (14) 2 7 (21) 1 7 


Plan 11.7 t=7,k=3,r=3,b =7, = 1, Е =.78, Туре V 


Block 
() 12 4 (3) 3 4 6 (5) 5 6 1 0) 271% 
() 2 3 5 (à 4157 (6) 672 


473 


PLANS 
Plan 11.8 t=7,k =4,r=4,b =7,d = 2, Е = 88, Туре V 
Block 
(3567 41236 (023456 
0) 1467 () 2847 py we-z 
912517 (9 1345 
Plan 11.9 { = 8, k = 2,т = Т,Ь = 28, à = 1, = 57, Type I 
Block Вер. I Rep. II Rep. III Rep. IV 
a) 1 2 (5) 1 3 Оте а eds): 1 5 
Q 3 4 (9 5028 = (Nets pot) 2 3 
(8) 5 6 (7) 4 5 (1) 3 6 (15) 4 7 
(4) 7 8 @6 7 8 9 
Rep. У Вер. УТ Rep. УП 
an 1 6 QU Y 7. (95 10108 
(19 2 4 (602 6 (062 5 
19 3 8 03 3.5. ФЕТ 
(20) 5 т 094 8 (28) 4 6 
Plan 11.10 t=8,k=4,r=7,b = 14, à = 3, E = 86, Type I 
Block Rep. I Rep. II Rep. III Rep. IV 
(0 1234 (8) 1278 (5) 1037618 (01467 
(2) 5678 (8486 (2457 (8 2358 
Кер. У Кер. УІ Вер. УП 
0 1256 (ii) 1857’ 03 14525 
(10) 3478 (22468 (14) 2367 
Plan 1а t =9,& = 2,7 = 8, b = 36, à = 1, = 56, Type II 
Reps. I Reps. III Reps. V Reps. VII 
Block and II and IV and VI and VIII 
0 1 2 0013 аў 1 4 (8 1 5 
(2) 2 8 а) 2 5 (20) 2 6 (29) 2 4 
(3) 3 4 (12) 3 6 (21) 90x (30) 3 8 
(4) 4 7 (13) 4 9 (22) 4 5 (в1)24---6 
(5) 5 6 (14) 5 8 (23) 5 7 (32) 3 5 
(6) 1 6 (15) 6 F (006 8 (33) 6 9 
(7) 3 7 (16) 1 7 (25) 7 9 (34) 2 7 
(8 8 3 (04 в 
(9 5 9 a8) 2 9 (27) 3 9 (36) 1 9 


| 


474 INCOMPLETE BLOCK DESIGNS 
Plan 11.11 ¢=9,k = 4,т = 8,b = 18, à = 3, Е = .84, Type II 


Reps. I, II, Reps. V, VI, 
Block III, and IV VII, and VIII 
(01467 00) 125 7 
(2) 2689 (il) 2356 
(3) 1389 (12) 3 4 7 9 
(0122384 (13) 1 2 4 9 
51578 (14) 1 5 6 9 
(0 4 5 6 9 (1 13 6 8 
( 273 6:7 (16) 4 6 7 8 
(8) 2458 (17) 3458 
(9 3579 8 27 8 9 


Plan 11.12 1= 9,К =5,r = 10,b = 18, л=5, Е = .90, Type II 


Reps. I, II, Reps. VI, VII, 

Block III, IV, and V VIII, IX, and X 
(1), 1'2-8—7? 8 40.128 5 9 
(2) 12468 45-1 2 5 6 8 
8) 23589 (12) 13 4 5 6 
(4) 23469 (1) 2 3-4 7 8 
(5) 18457 (4) 24579 
(0 24567 (1) 8 5 67 8 
() 18679 (16) 14789 
(8) Зо 07 346 89 
(9) 56789 (8) 12670 


Plan 11.18 t =9,k = 6,7 = 8,0 = 12, = 5, Е = .94, Type II 


Block Reps. I and II Reps. ІП and IV 
0124578 @125679 
(0) 235689 (5) 134589 
8) 134679 (6) 23467 8 

Reps. V and VI Reps. VII and VIII 
(9,4 8 5.6 7.8 00 456739 
8124689 (10,1 24.8 4 5 6 
(9.2 3 4.5 7.9 (12)21,2.3 7 8—9 


PLANS 475 
Plan 11.14 t=10,k =2,7=9,b = 45, X = 1, E = .56, Type I 
Block Rep. I Rep. П Вер. Ш Вер. ТҮ Вер. У 
ш 1 2 6 1 38 ad à 4 (93 5 QD! 6 
Q3 4 m 2 710 919) % пот) 22587 (0) 2 9 
956 4 8 (18) 3 7 (8) 3 10 (23) 3 8 
OT 8 () 5 9 (14 5 8 (19) 4 9 (24) 4 10 
© 9 10 Q0 6 10 (969 (9067 (095 7 
Rep. VI Rep. VII Rep. VIII Rep. IX 
Q 1 7 (31) 1 8 Q9 1^9 (4) 1 10 
Qn 2 6 (32) 2 3 (72 4 (42) 2 5 
(8 3 9 (33) 4 6 (935 (393 6 
Q9 4 5 (3) 5 10 (39 6 8 (44) 4 7 
00 8 10 (59 7 9 (40) 7 1 (45) 8 9 
Plan 11.15 4=10,& =3,r=9,b = 30, X = 2, E = 74, Type П 
Reps. I, Reps. IV, Reps. VII, 
Block II, and III V, and VI VIII, and IX 
012 8 (а) 12 4 (2) 13 5 
(2) 25 8 (12) 23 6 (92) 2 0.7 
() 87% 7 (13) 3 4 8 (23) 88 9 
0146 (14) 45 9 (24) 2 4 10 
() 57 8 (15) 15 7 (25) 3 5 6 
(0 4 6 9 (16) 6 8 9 (26) 1 6 8 
m i139 Q7 3 7 10 (7) 27 9 
(8 2 8 10 (18) 1 8 10 (28) 17 8 
(9) 3 9 10 (19) 25 9 (29) 1 9 10 
(10) 5 6 10 (20) 6 7 10 (30) 4 5 10 
Plan 1116 t= 10, k = 4, r = 6, b = 15, à = 2, E = 83, Type Ш 
Block 
() r2 554 (0 1 6 8 10 (1) 3 5 9 10 
20125 6 0236 9 12 3 6 7 10 
ON Lars (8 24710 (13) 345 8 
(4) 1 4 9 10 (0) 25810 (19 456 7 
(б) 157 9 00 278 9 (15) 468 9 
Plan 11.17 i = 10, k= 5, т =9,b = 18, à = 4, Е = 89, Type Ш 
Block 
GQ 123 4.5 (57145 6 10 аз) 2 5 6 8 10 
0) 1236 17 (8 148910 (4) 267910 
(812469 (0157910 (15) 846710 
() 1257 8 (Q0 234810 (16) 3457 9 
@ 1727680 ЗБ оО? 
(6) 13 7 8 10 (9 2 47182970 4567 8 


| 


476 INCOMPLETE BLOCK DESIGNS 


Plan 11.18 t =10,k = 6,7 = 9,0 = 15,А = 5, Е = .93, Туре Ш 
Block 


() 2 4:5 8: 9 (6) 23 4 6 8 10 (11) 14578 
(2) 5 6 7 8 9 10 m 1267910 (2) 123 5 7 
(3) 2456910 (80913568 9 (13) 23567 
@ @ 2 4:0 728 (9 123 8 9 10 44 13456 
() 18 4 7:8 9.10. (0) 23457 9 (15) 13467 


Plan 1148 t= 11, k = 2, r = 10, b = 55, à = 1, E = .55, Type II 


Reps. I Reps. ПІ Reps. V Reps. VII 

Block and II and IV and VI and VIII 
() 1 2° (9 3 (23) 1 4 (34) 1 5 (45) 
@2 u (3) 2 $6 192 3 (3) 2 9 (49 
(3) 3 10 (4) 3 5 (5 3 7 (3) 3 6 (47) 
(4) 4 5 (15) 4 10 (920 4 6 (37) 2 4 (48) 
(5) 5 6 (16 5 9 (27) 5 10 (38 5 7 (49) 
(60 6 7 (7) 6 8 (28 6 9 (39) 6 10 (50) 
(7) 1 7. 041089) “2 * 9029) T tt (40 7 8 (51) 
(8 3 8 (9 1 8 (302 8 (41) 4 8 “(52) 
(9 4 9 (0 7 9 (1) ! 9 (2 9 п (5%) 
00 9 10 (2) 10 п (32) 8 10 (43) 1 10 (54) 
(п) 8 E11 90(99) T G3 5 п (44) 3 пи (55) 


Plan 11.19 t= 11,k = 5,r = 5,Ь = А = 2, Е = 48, Туре IV 


Block 
(9122787 5 СВ ry 7 800113 
(2) 234 6 9 (8 8 910 1 4 
() 845 7 10 (9), .9. 10..11,,2. 6 
(0456 81 Q0* 10-116 1-3 6 
(5) 567 9 1 MYM ade 97: 4 7 
(6) 6 78 10 2 


Plan 11.20 t=1,k=6,r=6,b = 11,\ =3, Е = 92, Type IV 


Block 

Q 4679 10 it (0 1-2 4 5 6 1 
@ 1 5 7.8 10 11 8) 235671 
(8) 1268 91 9% 13467 
4) Р 27 307 910 00 24578 
(5) 234 8 101 anes 506 8 9 1 
6) 1345 эп 


PLANS 477 


Plan 11.21 i= 13, k = 3,r = 6,b = 26, ^ = 1, E =.72, Type II 
Reps. I, Reps. IV, 
Block II, and Ш V, and VI 
01 3 9 à 2 5 6 
(2) 2 4 10 (15) 36 7 
3)3 51 (10) ay 18 
(4) 4 6 12 an 5 8 9 
(5) 5 7 13 (18 6 9 10 
(6) 1 6 8 (19) 7 10 1 
(02.7 9 (20) 8 1 12 
(89 3 8 10 (21) 9 12 13 
09 а өп (22) 1 10 13 
(0) 5 10 12 өзі 21 
ay 6 11 13 (24) 2 3 12 
(19у 1757212 (25) 3 4 13 
(13) 2 8 13 (60 1 4 5 
Plan 11.22 t= 13, k = 4,r = 4, b = 13,А = 1, E = 81, Type IV 
Block 
() 12 4 10 9 6 792 нЕ? 
203351 ( 7 8103 (2 12 13 2 8 
@ 34 6 12 (8 8 9114 Q9 13 139 
(4) 4 5 7 13 (9 9 10 12 5 
568 1 (10) 10 H 13 6 
Plan 11.28 t= 13, k = 9, r = 9, b = 13, à = 6 E = 96, Туре IV 
Block 
() 85678 9 1l 12 J8 () 123 5 6 7 10 12 13 
0 14678 9 10 12 13 0) 123467 8 1 18 
@ 12 57 8" 9 0 Ji 38 (0 2 3. 45/7 8 9 12 
(123870 89 10 H3 (1) 234568 9 10 13 
( 23 4 7 9 10 1 12 18 12 134567 9 101 
(0 13 4 5 8 10 ll 12 13 (8) 2 45 6 7 8 10 1 12 
(012456 ош 12 18 


Вер. ТҮ 
16) 1 8 9 
(17) 2 13 15 
089 3 4 7 
(19) 5 14 
(20) 6 10 12 


Reps. VII, VIII, 


478 INCOMPLETE BLOCK DESIGNS 
Plan 11.24 t= 15,6 =3,r =7,b = 35, = 1, Е = .71, Type I 
Block Rep. I Rep. П Rep. III 

i-e 8 (0 1 4 5 С) "6^7 

(2.4 8 12 (02 810 (12) 2 9 11 

(3 5 10 15 (8 3 13 14 (13) 3 12 15 

(4) 6 1 13 (9.75 "9 15 (14 4 10 14 

() 7 9 14 (0) 7 11 12 (15) 5 8 13 

Rep. V Rep. VI Rep. VII 

(21) 1 10 11 (20 1 12 13 (31) 1 14 15 
(22) 2 12 14 (7) 2"5 7 (32 2 4 6 
Q3 3 5 6 (28) 3 910 (33) 3 8 1 
(24 4 9 18 (29 4 11 15 (34) 5 9 12 
(25) 7 8 15 (30 6 8 14 (35 7 10 13 
Plan 11.25 t=15,k = Т, т =7,b = 15, à = 3, Е = .92, Type IV 

See incomplete latin squares Plan 13.7; randomize units in blocks ignoring repli- 
cations. 
Plan 11.86  t—15,k = 8,7 = 8,b = 15, = 4, = .94, Type IV 

See incomplete latin squares Plan 13.8; randomize units in blocks ignoring repli- 
cations. 
Plan 11.27 t= 16,k = 6,r = 6,Ь = 16, X = 2, = .89, Type IV 

See incomplete latin squares Plan 13.9; randomize units in blocks ignoring repli- 
cations. 
Plan 11.28 1 = 16, k =6,r = 9, b = 24, à = 3, Е = 389, Type II 

Reps. I, II, Reps. IV, V, 

Block апа III and VI 


(012 5 61112 
(234 7 8 910 
(3596 910 13 14 
(4) 7 8 11 12 15 16 
(5 I 2 9 10 15 16 
(6) 34 11 12 13 14 
(012 7 81314 
(834 5 61516 


(913 6 81315 
(1024 5 71416 
(1D 57 9 11 13 15 
(12) 6 8 10 12 14 16 
324 6 8 91 
(14) 13 5 7 10 12 
(15) 2 4 10 12 13 15 
(16) 13 9 11 14 16 


and IX 
(17) 14 5 81011 
(1823 6 7 912 
(19) 8 8 9 12 13 16 
(2014 6 713 16 
(2 14 9 12 14 15 
(22) 6 7 10 11 14 15 
(23) 2 3 10 11 13 16 
(24) 23 5 814 15 


PLANS 479 
Plan 11.29 t= 16, k = 10, r = 10, = 16, à = 6, E = .96, Type IV 


See incomplete latin squares Plan 13.10; randomize units in blocks ignoring repli- 
cations. 


Plan 11.90 t= 19, k = 3,r = 9, = 57, à = 1, E = 70, Type II 


See extended incomplete latin squares Plan 13.15a; randomize units in blocks ig- 
noring replications. 


Plan 11.31 t= 19, k = 9, r = 9, b = 19, à = 4, Е = 94, Type IV 


See incomplete latin squares Plan 13.11; randomize units in blocks ignoring repli- 
cations. 


Plan 11.32 i= 19, k = 10,7 = 10, b = 19, = 5, E = .95, Type IV 


See incomplete latin squares Plan 13.12; randomize units in blocks ignoring repli- 
cations. 


Plan 11.38 { = 21, k = 3, r = 10, b = 70, à = 1, = 70, Type I 


Block Rep. I Rep. II Rep. II Rep. IV Rep. V 
() і 23 (8 1415 (15) 1 517 (22) 1 6 9 (29) 1 721 
(2) 4 5 6 (9) 2 511 (16) 2 4 14 (23) 2 716 (30) 2 18 17 
3 7 8 9 (10) 3 916° (17) 3 711 (24) 3 821 (31) 81018 
(4) 10 11 12 (1) 61720 (18 610 19 (25) 41719 (82 4 811 
(5) 131415 (12 71219 (19) 81620 (26) 51013 (38) 51619 
(6) 16 17 18 (13) 81318 (20) 915 18 (27 1115 20 (84 6 12 15 
(7) 192021 (14 101421 (21) 12 1321 (28) 12 14 18 (35) 9 14 20 


Rep. VI Rep. VII Rep. VIII Rep. IX Rep. X 
(86) i 810 (48) 11118 (50) 11220 (5) 11319 (64) 1 14 16 
(87) 21819 (4) 21020 (51) 2 6 8 (59 2 9 12 (65) 21521 
(88) 31517 (45) 3 512 (52) 31419 (59) 3 4 20 (66) 3 613 
(39) 41216 (46) 4 918 (53 41821 (60) 5 814 (67) 4 7 10 
(40) 5 921 (47) 61621 (54 5 715 (6) 6 718 (68) 5 18 20 
(41) 61114 (48) 71417 (5) 91017 (62 10 1516 (69) 81217 
(42) 71320 (49 81519 (66) 111316 (63) 11 1721 (70 9 11 19 


Plan 11.84 t= 21, = B, r = 6, b= 21, à = 1, E = М, Type IV 


See incomplete latin squares Plan 13.13; randomize units in blocks ignoring repli- 
cations, 


INCOMPLETE BLOCK DESIGNS 


t= 201, Е =7,r = 10, = 30, à = 3, E = .90, Type III 


480 
Plan 11.85 
Block 
(1) 2 5 10 n 17 19 20 
(2) 3 6 11 12 18 20 21 
(3) 4 7 12 18 19 21 15 
(4) 5 1 13 14 20 15 16 
(5) 6 2 14 8 21 16 17 
(0 73 8 9 15 17 18 
( 14 9 10 16 18 19 
(8 34 S 13 17 19 20 
(9 4 5 9 14 18 20 21 
(10) 5 6 10 8 19 21 15 
(4) 6 7 11 9 20 15 16 
(12) 7 1 12 10 21 16 17 
(13) 1 2 13 11 15 17 18 
(14) 2 3 14 12 16 18 19 
05 16 9 12 17 19 20 
Plan 11.36 


See extended incomplete latin squares Plan 13.168; randomize units in blocks ig- 


(16) 
(17) 
(18) 
(19) 
(20) 
(21) 
(22) 
(23) 
(24) 
(25) 
(26) 
(27) 
(28) 
(29) 
(30) 


2 7 10 13 18 2 21 
3 1H 1 19 21 15 
4 2 12 8 20 15 16 
5 3 13 9 21 16 17 
6 4 14 10 15 17 18 
7 5 8 16 18 19 
1-2 4 8 өш 
28 Б 9 10 12 15 
34 6 10 H 13 16 
45 71 12 14 17 
5 6 1 12 13 8 18 
6 7 2 13 14 9 19 
71 3 14 8 10 20 
12 8 4 5 6 T 
8 9 10 11 12 13 M 


t = 25, k = 4,т = 8, b = 50, à = 1, Е = 78, Type II 


noring replications. 


Plan 11.37 


See incomplete latin squares, Plan 13.1а; randomize units in blocks ignoring rep- 


lications, 


t = 25, k = 9, r = 9, b = 25, à = 3, Е = .93, Type IV 


PLANS 48) 
Plan 11.38 i = 28, k =4,r = 9, b = 63, X = 1, = 78, Type I 
Block Rep. I Rep. II Rep. Ш 
d) 28 1 10 19 @ 28 2 п 20 (5) 28 3 12 21 
Q 2 9 13 16 9 3 11117 (б) 4 2 15 18 
з) 3 8 H 18 (10) 4 9 12 10 а) 5 13 11 
à 4 723 2 (1 5 8 24 25 (8 6 9 25 26 
() 5 6 20 27 a) 6 7 21 19 19 7 8 22 20 
(6) 12 17 22 25 (13) 13 18 23 26 (20) 14 10 24 27 
(7) 14 15 21 26 (14) 15 16 22 27 (21) 16 17 23 19 
Rep. IV Rep. V Rep. VI 
(22) 28 4 13 22 (29) 98 5 14 23 (36) 98 6 15 24 
(23) 5 3 16 10 в) 6 417 H ТӘЙ е В 12 
(4) 6 2 14 12 (8) 7 3.35.18 (388 8 4 16 M 
(5 7 1 26 27 (82) 8 2 27 19 (39 9 3 19 20 
(б) 8 9 23 21 (33) 9 1 24.22 (40) 1 2 25 23 
(27) 15 11 25 19 (34) 16 12 26 20 (а) 17 13 27 21 
(28) 17 18 24 20 (35) 18 10 25 21 (42) 10 11 20 22 
Rep. VII Rep. VIII Rep. IX 
(43) 28 7 16 25 (50) 28 8 17 26 (57) 98 9 18 27 
(4) 8 6 10 13 (1) 9 71M (58 1 8 12 15 
(4) 9 517 15 (52) 1 6 18 16 (9) 2 7 10 17 
(4) 1 4 920 21 (53) 2 5212 (6) 3 6 22 23 
(а) 2 3 26 24 (4) 8 4 27 95 (6) 4 5 19 26 
(4) 18 14 19 22 (55 10 15 20 23 (62) 11 16 21 24 
(49) 11 12 27 93 (6) 12 13 19 24 (03) 18 14 20 25 


482 INCOMPLETE BLOCK DESIGNS 


Plan 11.89 1 = 28,k = 7,7 = 9,Ь = 36, à = 2, E = .89, Type ПТ 
Block 


(D 478954925. (19) 4 8 Il 17 19 21 25 
0 1 5 9 10 11 15 24 (20) 1 13 14 18 23 25 26 
( 6 8 13 15 16 18 21 (21) 2 4 5 6 16 2 23 
(4) 7 12 13 17 22 24 25 (22) 3 4 10 П 12 14 18 
(5) 4 10 16 17 20 26 27 (23) 1 9 14 16 17 19 22 
(0 2 18 19 22 26 28 (2%) 1 2 4 13 20 24 98 
() 1 8 $6 12 19 28 27 (25) 3 5 8 17 28 24 926 
Я 2 3 5 14 20 21 25 (29) 5 6 7 10 19 25 28 
() 1.2 8 10 12 16 2 (7) 1 6 7 8 li 2 26 
(10 2 3 6 9 1 13 17 (28) 9 10 13 19 20 21 23 
(1) 4 5 12 13 15 19 26 (29) 2 8 14 15 19 24 27 
(12 3 7 16 18 19 20 24 (9) 3 9 15 16 25 26 28 
(3) 6 10 14 21 22 24 26 (31) 5 8 9 12 18 20 22 
(14) 11 15 20 22 23 25 27 (82) П 12 16 21 23 24 28 
{ИБ 177 187 21701128. -(3 1 3 4. 7 15 21 22 
а) 2 7 9 12 21 2% 27 (М 5 7 M 13 14 16 27 
а) 3 8 10 13 22 27 28 (35 4 6 9 18 24 25 27 | 
(8) 6 12 14 15 17 20 28 (%) 2 7 10 15 17 18 23 


Plan 11.40 4-31,% =6,r = б,Ь = 31, à = 1 E = .86, Type IV 

See incomplete latin squares Plan 13,14; randomize units in blocks ignoring rep- 
lications. 
Plan 1141 4= 31, k = 10,7 = 10, b = 31, à = 3, E = .93, Type IV 

See incomplete latin squares Plan 13.2a; randomize units in blocks ignoring rep- 
lications. 
Plan 11.42 t=37,k =9,r =9,b = 37, = 2, Е = .91, Type IV 

See incomplete tin squares Plan 13.15; randomize units in blocks ignoring rep- 
lications. 
Plan 11.48 t= 41, k =5,r = 10, b = 82, à = 1, E = 82, Type Ш 

See extended latin squares Plan 13.17a; randomize units in blocks ignoring rep- 
lications. 
Plan 11.44 {= 57,k =8,r = 8, b = 57, = 1, Е = .89, Type IV 

See incomplete latin squares Plan 13.38; randomize units in blocks ignoring rep- 
lications. 
Plan 11.45 t= 13, К = 9,7 = 9, b = 73, à = 1, = .90, Туре IV 

See incomplete latin squares Plan 13.4a; randomize units in blocks ignoring rep- 
lications. 
Plan 11.46 t= 91, k = 10, r = 10, b = 91, à = 1, E = .91, Type IV 

See incomplete latin squares Plan 13.5a; randomize units in blocks ignoring тер- 
lications. 


CHAPTER. 12 
Lattice Squares 


12.1 Description 


19.11 Balanced Lattice Squares. The number of treatments must be 
an exact square. Within each replicate, the k? treatments are arranged 
on the plan in a k X k square. The method of grouping into rows and 
columns, which varies in successive replications, is such that the treat- 
ment means can be adjusted for differences among the rows and columns 
of each square. Thus, in addition to the elimination of differences among 
replicates from the experimental errors, the design permits a “double 
control” within each replicate, similar to that obtained in a latin square. 

The principle which governs the grouping into rows and columns may 
be seen from the plans 12.1-12.8. With 9 treatments, for instance, treat- 
ment 1 occurs in the same row as treatments 2, 3, 6, and 8, and in the 
same column as treatments 4, 5, 7, and 9. Thus every other treatment 
appears with treatment 1 either in the same row or in the same column. 
More generally, any pair of treatments occurs once in the same row or in 
the same column. 

This property holds for all plans having an odd number of treatments, 
i.e., for 9, 25, 49, 81, 121, and 169 treatments. The number of replicates, 
for k? treatments, is (k + 1)/2. The standard error of the difference is 
not exactly the same for all pairs of treatments, though the variation in 
accuracy is small. Separate formulae are given for the two standard 
errors for cases in which they may be needed. 

A design with twice the basic number of replicates (e.g., 25 treatments 
in 6 replicates) is obtained by repeating the plan, the rows being inter- 
changed with the columns. In designs so produced, every pair of treat- 
ments occurs together once in the same row and once in the same column. 
On account of this property, the standard error of the difference is the 
same for all pairs of treatments, whether the row and column differences 
are large or small. 

Within the most useful range, the only even numbers of treatments 
which provide lattice square designs are 16 and 64. These designs have, 
respectively, 5 and 9, ie. (k + 1) replicates. Every pair of treatments 
occurs once both in the same row and in the same column. à 

The available selection of designs (up to 12 replicates) is summarized 
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in table 12.1. By analogy with balanced lattices, the designs may be 
called balanced lattice squares. 


TABLE 12.1 AVAILABLE NUMBERS OF TREATMENTS AND REPLICATIONS FOR 
BALANCED LATTICE SQUARES 


Number of treatments 9 16 25 49 64 81 121 169 
Number of replicates 48 5,10 3,6 48 9 5,10 6,12 7 


For 16 treatments in 10 replicates, the basic plan is used twice. Al 
though the basic plan (plan 12.1) requires only 2 replicates, the design 
for 9 treatments is not recommended with less than 4 replicates. Even 
in this case there are only 8 d.f. for estimating row and column variances 
and only 8 d.f. for error. Little would be gained over randomized blocks 
unless the extra controls were highly effective. A design with 9 treat- 
ments and 8 replicates is found by repeating the design for 9 Б 
and 4 replicates. 

Certain factorial experiments can be arranged in balanced lattico 
squares; e.g., 3 X 3,8 X 2,4 X 44 X 2X2, 21,5 X 5,7 X 7,8 X8, 
43, 28,9 X 9, 3*. All main effects and interactions are confounded to the 
same extent with rows and columns. 


12.12 Partially Balanced Squares. For a balanced design, the availa- 
ble numbers of replications are rather severely restricted. Although de- 
signs are possible for other numbers of replicates, they lack the symmetry 
of the designs described above, with the consequence that the statistical 
analysis usually becomes more complicated. It happens, however, that, 
if the number of replications is less than that required for a balanced de- 
sign, the statistical analysis follows the same procedure as for balanced. 
designs, apart from minor changes in some formulae. In experimenta- 
tion where double grouping has proved effective but where the replica- 
tions are not sufficient for a balanced lattice square, the additional de- 
signs shown in table 12.2 may be useful. 


TABLE 12.2 OrHER LATTICE SQUARE DESIGNS 


Number of treatments 49 ` 64 81 121 169 
Number of replicates 3 34 34 8/46 3, 4, 5, 6 


When the number of-treatments is odd, the plans for these designs аге 
obtained simply by taking the desired number of replicates from the 
plans for the balanced designs. Thus, for 49 treatments in threefold 
replication, we use squares I to Ш from plan 12.4. With 64 treatments, 
squares I, III, and V are used for 3 replicates, and squares I, III, V, and 
VII for 4 replicates. 

Partially balanced lattice squares are analogous to the triple, quad- 
tuple, and quintuple lattices. 
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12.13 Arrangement of Experimental Material. The method of ar- 
ranging the experimental material is similar to that for an ordinary latin 
square. With k? treatments, the units are arranged іп k X k squares so 
that the rows and columns of each square correspond to the two types of 
variation whose effects we wish to eliminate from the errors. In field ex- 
periments, the plots of each replicate are usually laid out in square forma- 
tion, in which case row and column differences represent fertility vari- 
ations in two directions at right angles to each other. If the width of a 
greenhouse bench accommodates 5 pots, 25 treatments can be laid out in 
replicates which consist of 5 rows of 5 pots each. 


12.14 Randomization. Within each replicate the rows and columns 
of the basic plan should be permuted separately at random before apply- 
ing the treatments. It is also advisable to assign the treatments at ran- 
dom to the treatment numbers in the plan. 


12.2 Statistical Analysis 


12.21 Designs with (k + 1)/2, or Fewer, Replications. The same in- 
structions apply to the partially balanced designs and to the balanced 
designs with (k + 1)/2 replications. Table 12.3 shows the plan and 
yields for a lattice square with 25 corn hybrids in 3 replications, con- 
ducted by the North Carolina Experiment Station in 1942. Each plot 
contained 4 rows, with 10 hills per row. 

The computations proceed as follows. The number of treatments is 
12, and the number of replications r. 

1. Find the row, column, treatment, replication, and grand totals. 
The treatment totals are placed in a square under the plan. 

2. For each row, calculate the L value, where 


L = total (from all replications) of all treatments included in the 
row — r times row total Г 
eg., 


1 = 89.8 + 93.3 + 82.1 + 88.9 +91.1 — 3(159.1) = —32.1 
As a check, the total of the L values over a replication equals 


(Grand total) — r(replication total) 
Thus, 
—76.2 = 2189.4 — 3(755.2) 
Similarly for each column we obtain M values. 


M = total (from all replications) of all treatments included in the 
column — r times column total 


TABLE 12.3 YIELDS (POUNDS CORN PER PLOT) FOR А 5 X 5 LATTICE SQUARE IN 
3 REPLICATIONS 


(18) 
(24) 
(12) 
(6) 
(5) 


33.3 
24.6 
28.5 
26.7 
40.1 


Total 153.2 


M —16.0 
e — 0.83 


(20) 
(15) 
(25) 

(5) 
(10) 


30.9 
37.2 
32.7 
32.0 
39.8 


Total 172.6 


M — 


62.2 


e — 822 


(19) 
(11) 
(22) 
(5) 
(&) 


28.7 
19.4 
18.3 
30.2 
34.4 


Total 131.0 


M +86.0 
в + 445 


(1) 
(6) 
(11) 
(16) 
(21) 


(1) 
(6) 
(11) 
(16) 
(21) 


Treatment numbers are shown in parentheses 


Square I Total L 5 
(9) 30.7 (11) 35.4 (2) 301 (25) 29.6 | 1591 —321  —258 
(15) 30.8 (17) 28.8 (8) 34.8 (1) 32.5 | 151.5 -105  —084 
(8) 24.0 (10) 28.4 (21) 25.0 (19 35.1 | 1410 +224 4-180 
(22) 27.2 (4) 25.6 (20) 250 (13) 29.4 | 133.9 +110 +088 
(16) 35.7 (23) 30.1 (14) 30.3 (7) 33.5 | 169.7 -670 —5.39 
148.4 148.3 145.2 160.1 | 755.2 —6.13 
438  -—128 -100 -412 —76.2 
+0.20 - 066 – 052 — 213] —3.94 
Square II 
(17) 33.3 (19) 38.8 (16) 27.7 (18) 34.4 | 165.1 — 45.9 — 3.69 
(12) 31.2 (14) 27.9 (11) 27.3 (13) 21.6 | 1452 - 58 — 047 
(22) 43.0 (24) 28.5 (21) 24.7 (23) 22.7 | 151.6 - 462 — ЗТ 
(2) 32.8 (4) 31.8 (1) 287 (3) 32.3 | 1576 - 338 - 272 
(7) 37.3 (9) 31.9 (6) 34.0 (8) 34.3 | 177.3 - 69.3 — 5.57 
177.6 158.9 142.4 145.3 | 7968 — 16.16 
—855 -301 -126 -106 -201.0 
= 448 — 156 -065  — 0.55 | —10.41 
Square III 
(15) 26.3 (23) 217 (6) 21.9 (2) 26.0 | 1246 + 69.1 + 5.56 
(7) 17.3 (20) 16.9 (3) 22.6 (24) 24.2 | 100.4 + 980 + 7.88 
(18) 22.1 (1) 17.5 (14) 25.0 (10) 26.9 | 109.8 +105.9 + 851 
(21) 27.5 (9) 30.7 (17) 281 (13) 27.6 | 1441 + 9.3 + 0.75 
(4) 32.8 (12) 31.9 (25) 28.8 (16) 30.6 | 1585 — 51 — 041 
126.0 118.7 126.4 1353 | 637.4. 422.29 
+616 +548 +46.8 --280 +277.2 
+ 3.19 + 284 +4242 + 1.45 | 414.35 
Grand total | 2189.4 0 0 
Treatment totals 
(2) 88.9 (3) 78.9 (4) 90.2 (5) 102.3 
(7) 881 (8) 103.5 (9) 933 (10) 95.1 
(12) 916 (13) 786 (14) 832 (15) 943 
(17) 90.2 (18) 89.8 (19) 102.6 (20) 72.8 
(22) 88.5 (23) 74.5 (24) 773 (25) 911 21894 
Adjusted treatment totals 
(2 85.7 (3 87.9 (4) 88.9 (5) 95.3 
(7) 81.6 (8) 1001 (9) 874 (10) 974 
(12) 901 (13) 78.5 (14) 862 (15) 987 
(17) 83.8 (18) 93.8 (19) 1070 (20) 77.0 
(22) 944 (23) 726 (24) 797 (25) 81.5 
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3. We now compute the analysis of variance. The total s.s. and the 
sums of squares for replications and treatments are found in the usual 
way. If the symbol L, denotes a replication total of the L’s, the sum of 
squares for rows within replications, adjusted for treatments, is 


yn bu 
krr— 1)  KEr(r—1) 
E (32.1)? + (10.5)? +- - -+ (5.1)? "i (76.2)? + (201.0)? + (277.2)? 
30 150 
= 1405.95 — 820.32 = 585.63 
The sum of squares for columns, eliminating treatments, is 
(16.0)? + (3.8)? +- - -+ (28.0)? E (76.2)? + (201.0)? + (277.2)? 
30 150 


= 1058.53 — 820.32 = 238.21 
The complete analysis of variance is shown in table 12.4. 


TABLE 12.4 ANALYSIS OF VARIANCE FOR A 5 X 5 LATTICE SQUARE 


d.f. 8.8. m.s. 
Replications (r — 1) 2 546.88 273.44 
Treatments (i — 1) 24 611.09 25.46 
Rows (adj.) r(k — 1) 12 585.68 48.808, 
Columns (adj.) r(k — 1) 12 238.21 19.852, 
Error (k = По —r—k—1) 24 229.79 9.57Е, 
Total (rk? — 1) 74 2211.60 


4. This step leads to the adjusted treatment means. Let 
E, = mean square for rows 
Е, = mean square for columns 
Е, = mean square for error 


(Е, — E) 48.80 — 9.57 _ ам 

k(r — 1)E, (5)(2)(48.80) 

Ж (Е,- Е) z 19.85 — 9.57 _ атал 
k(r— ПЕ. (5)(2)(19.85) 


ЕЕ, or Е, is less than E, then Х or p’, as the case may be, is taken as 0. 
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5. Multiply the L’s by Х to give 5 values, and the M’s by p’ to give 
« values, as presented in table 12.3. 

6. The adjusted total for any treatment is secured by adding to the 
unadjusted total the à and е values for each row and column in which 
the treatment appears. For treatment (1), we have 


78.7 + (—0.84) + (—2.13) + (—2.72) + 
(—0.65) + (8.51) + (2.84) = 83.7 


To obtain the treatment means (not shown) we divide by the number 
of replicates. 

7. The average variance of the difference between two adjusted treat- 
ment means is 


2E, rk 2(9.57) | (3) (5) | 

кер ONG em ; 

т [ ате) re ee 
= 8.49; в.е. = 2.91 


Except perhaps with the 5 Х 5 and smaller squares, it is sufficiently ас- 
curate to use this figure for comparisons between any pair of treatments. 
More accurately, for two treatments that appear in the same row, the 
variance of the difference is 


2Е, 

ТЕ + (r — 1) + r4] = 8.40; s.e. = 2.90 
For two treatments in the same column. 

2Е, 

— + rv’ + (т — 1] = 8.58; se. = 2.93 


Clearly the average error is good enough in this experiment. For the 
partially balanced designs one more formula is needed, since some pairs 
of treatments do not occur together either in a row or in a column. The 
variance of the difference is 


2Е, 
ie [o rU +z’) 


8. As is typical of these designs, the analysis does not provide an exact 
F-test. The treatments m.s. in the analysis (table 12.4) cannot be com- 
pared with the error m.s., since the former has not been adjusted for row 
and column effects. For an approximate F-test, compute the sum of 
squares of deviations of the adjusted treatment totals in table 12.3. This 
comes to 1606.05. Division by 3, since there are 3 replicates, gives 
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535.35, with a mean square of 22.31. This may be compared with the 
effective error m.s., which is 


rk 15 
n| 1 +— QN +) | = 9.57 |1 + — (0. = 
z| +“ zl 5 [ +> (0.1329) | 12.73 


The F-ratio is 22.31/12.73, or 1.75, with 24 and 24 d.f. 

9. The gain in precision relative to randomized blocks is estimated as 
follows. If the experiment were analyzed as randomized blocks, rows 
and columns would be amalgamated with error, giving 


d.f. 8.8. m.s. 
Replications 2 546.88 
Treatments 24 611.09 
Error 48 1053.63 21.95 


This error, 21.95, is compared with the effective error m.s., 12.78. The 
relative information is estimated as 21.95/12.73, or 172%. Thus 3 
replicates of the lattice square appear about as precise as 5 with ran- 
domized blocks. 

Often a comparison with a lattice design will be of more interest. This 
experiment could have been planned as a triple lattice, and in that event 
the rows would probably have been chosen as blocks. The intra-block 
error would be derived from the pooled sum of squares for columns and 
error in table 12.4. "Тһе mean square would be 13.00, with 36 44. Con- 
sequently, for the triple lattice we have Ер = 48.80 (m.s. for rows), E, = 
13.00. The effective error m.s. (section 10.28) is 


AE + em = z [1 + СЫ 


3(48.80 — E UM 


mien Е * ^ @) 4880) 


The relative accuracy of the lattice square to the triple lattice is 
estimated as 15.38/12.73, or 121%. E 

An account of the theory and a worked example are given by Yates 
(12.1) for balanced squares and by Cochran (12.2) for partially balanced 
Squares. 


12.22 Designs with (k + 1) Replications. In this case the analysis is 
slightly different, since it is possible to apply & single adjustment for all 
the rows in which a treatment lies, instead of making a separate adjust- 
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TABLE 12.5 PERCENTAGES OF SQUARES ATTACKED BY BOLL WEEVILS FOR А 
4 X 4 LATTICE SQUARE IN (К + 1) REPLICATIONS 


Boll weevil infestation 


Square I totals 


(10 290 (2 203 (9) 177 (11) 263 73.3 
(2) 47 (4) 90 а) 73 (3) 83 29.3 
(14) 90 (16) 67 (18 117 (5) 43 31.7 
(69 40 (8) 5.0 (5) 57 (7) 14.3 29.0 


Column totals 26.7 41.0 42.4 53.2 | 163.3 


(5) 190 (1) 87 (15) 180 (2) 157 | 564 
(10) 120 (7 60 (4) 153 (13) 120 | 453 
(16) 127 () 63 (6 17 (1) 130 | 337 
8 37 (14 37 © 80 (8) 133 | 287 


Column totals 47.4 24.7 38.0 54.0 | 1641 


(10) 170 (15) 70 (8 103 (1) 13| 356 
(9) 113 (16) 123 (7) 30 (2) 53| 319 
(12) 123 (9 87 (6 80 (3 93| 383 
(11) 303 (1) 223 (5 110 (4 127 | 763 


Column totals 70.9 50.3 32.3 28.6 | 1821 


(16) 50 (12) 103 (8) 57 (0 127 | 337 
(а) 27 (9 67 (8) 103 (7) 57 | 254 
Q) 10 0 103 ( 113 (13) 117 | 343 
(0 110 (2 190 (14) 207 (10) 297 | 804 


Column totals 19.7 46.3 48.0 59.8 | 173.8 


8 20 (16 50 ©) 40 (10) 137 | 247 
(0 93 (9 17 (4 63 (15 123 | 296 
(12) 167 (7 43 (4 187 (1) 87 | 484 
(13) 167 (2) 300 (п) 257 (8 140 | 864 


Column totals 44.7 41.0 54.7 48.7 | 189.1 
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ment for every row. On the whole, however, the analysis is more com- 
plex than with (k + 1)/2 replications. 

The example is а 4 X 4 lattice square conducted by the U. S. Bureau 
of Entomology and Plant Quarantine at Florence, 5. C., and described 
by Wadley (12.3). The treatments were 16 arsenical insecticides applied 
to cotton with a hand dusting machine. Plots were 10 rows wide and 
70 feet long, being about М в acre. To allow for border effects, records 
were taken only from the 4 center rows. The data in table 12.5 are the 
percentages of squares * showing attack by boll weevils. These figures 
were obtained by examining 100 squares per plot, 25 from each of the 
4 center rows. Such counts were made at intervals during the summer: 
the data are averages from 3 counts made in August. 

The simplest computational routine, devised by Yates (12.1), will be 
presented here, though it is not too well adapted for making clear the 
meaning of the various steps. Yates’s paper should be consulted for an 
account of the theory. 


TABLE 12.6 ORIGINAL AND ADJUSTED TREATMENT TOTALS 


Treat- Adj. 

ment treat- 
totals, ment Adj. 
T Re с, D L J K м’ totals means 
1 246 1813 164.1 17.2 64.3 815 1831 150.3 32.24 645 
2 тат 2844 1966 87.8 —250.8 -16%0 1004 1882 6841 13.68 
з 336 1464 2219 -755 2748 1003 - 27.2 —1027 4304 813 
4 560 2142 2221 — 79 254 175 - 62 - 141 5679 1136 
5 500 2207 231 – 24 = 311 — 335 — 407 — 43.1 4720 944 
6 340 2110 1614 498 - 466 30 1518 2014 37.80 7.58 
7 333 1800 2110 -310 105.6 746 — 184 — 494 3685 737 
8 483 2134 2240 -106 - 14 -— 120 - 438 - 544 46.58 9.32 
8 500 1978 2403 —425 83.4 409 — 86.6 -1291 5007 1001 
10 814 259.3 253.5 58 — 985 — 927 — 768 - 69.5 7457 14.91 
п 980 2951 2525 426 —211.1 -1685 - 407 19 8795 17.59 
12 683 2501 227.6 225 —1049 - 824 - 149 7.6 63.51 12.70 
13 608 2360 2512 -152 - 644 — 798 -1052 —1404 5345 10469 
14 744 265.5 204.4 611 —1575 - 964 86.9 1480 71.6 14.27 
15 433 1787 2365 -578 152.1 оз — 79.1 136.9 4642 928 


422 5546 11.09 


16 417 1557 1994 -437 260.7 217.0 85.9 


Totals 8724 3489.6 3489.6 0 9 0 9 о 87239 17447 


1. Find the row, column, replication, and grand totals, all shown in 
table 12.5. 
2. Find the treatment totals Т (table 12.6). For each treatment find 


В, = total of all rows in which the treatment appears 
C, = total of all columns in which the treatment appears 


*А “square” is the name given to the young flower bud. 
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i 
It may save time to find all 3 quantities, Т, R;, and С,, by simultaneous i 
addition, since this means that the positions of a treatment in the dif- 
ferent replicates are found only once. This can be done if the machine 
carriage is moved so as to accommodate 3 sets of running totals. The | 
sum of the R; values and the sum of the C, values should each equal k 
times the sum of the T' values. 
"Thereafter the successive columns in table 12.6 are filled out as follows. 


D - В, — С, 

Ш =k? — (k- )R, +6 
J=D+L' 
K=J+(k—1)D 

М =D+K 


Note that the total of each of these quantities is zero. 

3. Compute the analysis of variance. The only sums of squares re- 
quiring special instructions are those for rows and columns. After ad- 
justment for treatment effects, it happens unfortunately that the sums 
of squares for rows and columns are not mutually orthogonal. By & 
well-known result in the analysis of variance, their combined sum of 
squares may be expressed in either of two ways: 


Rows (adj. for treatments) + columns (adj. for treatments and rows) 
= columns (adj. for treatments) + rows (adj. for treatments and 
columns) 

Both expressions are computed. 


Sum of squares for rows adjusted for treatments 


= SCL _ (64.3)? + (250.8)? +--+ (260.7 _ 1999.09 
Ek + 1) 320 


Sum of squares for rows adjusted for treatments and columns 


= SU) _ (81.5)? + (163.0)? +- - -+ (217.0)? _ 1026.76 
E —1) 192 


SM = 625.85 
P 10 


Sum of squares for columns adjusted for treatments and rows 


Sum of squares for columns adjusted for treatments = 
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Note that 1093.02 + 559.59 = 1026.76 + 625.85. 


TABLE 127 ANALYSIS OF VARIANCE 


d.f. 8.8. m.s. 
Replications k 4 31.56 7.89 
Treatments ы-і 15 1244.20 82.95 
Rows (adj. for treatments) 0-1 15 1093.02 
Rows (adj. for tr. and col.) 0-1 15 1026.76 68.452, 
Columns (adj. for tr.) 0-1 15 625.85 
Columns (adj. for tr. and rows) k- 15 559.59 37.31Е, 
Error (à — 1)(k — 2) 30 680.17 22.67Е, 
Total B+P-1 7 3608.54 


In finding the error s.s. by subtraction, note that we subtract the com: 
bined effect of rows and columns only once. 
4. This leads to the adjusted treatment totals. Compute 


q = РЕ,Е, — ЕД = (16)(68.45)(37.31) — (22.67)? = 40,348 
Q = (k — 1) = 121,044 
_ (B, — EQ(E,— E) _ (45.78)(126.57) 


Y = 0.04787 
Q 121,044 

‚ _ (Ee — EE, — E) _ (146005113) _ өз 
Q 121,044 


The adjusted treatment totals are 
T+NL' + М 
and are inserted on the right of the М” column ш table 12.6. For the 
means, divide by r. 4 4 
5. The error variance of the difference between two adjusted means 18 


шы [1 +k +) = 2010 [1 + 4(0.07824)] = 11.9 


To obtain an approximate F-test of the treatments, compute the sum 
of squares of deviations of the adjusted treatment totals, and divide by 
(k + 1)(k? — 1) to give the mean square on a single unit basis. This is 
tested against the effective error m.s, ЕДІ + KV +). —— i 

’ The gain in precision relative to a randomized blocks or lattice design 
is estimated by the procedure for the lattice square with (+ 1)/2 
replications, 
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12.23 Designs with 2(k + 1) Replications. This case is unlikely to 
be of much practical interest except possibly with the 3 X 3 design in 
8 replicates and the 4 X 4in 10. The analysis is similar to that for (k + 1) 
replications; the changes will be noted very briefly. 

The partition of degrees of freedom is as follows. 


d.f. 

Replications (2k + 1) 
Treatments (2-1 
Rows 

Component (a) (x? — 1) 

Component (b) (k? —1) 
Columns 

Component (a) (k? — 1) 

Component (0) (k? — 1) 
Error (2k — 3)(k? — 1) 

Total ЗЕЕ — 1 


Corresponding to any row, there is another row with the same set of 
treatments. Find the differences D, between the totals of corresponding 
rows. The sum of squares of deviations of the D, from their replication 
means, divided by 2k, gives component (a) of the rows s.s. Component 
(b) corresponds to the component that was obtained with (k + 1) rep- 
licates. As before, it has two forms, an L/ andaJ form. Both are found 
in the same way as with (k + 1) replications, except that the divisors 
must be doubled. The sums of squares for columns are derived similarly. 

The formulae for the adjustment factors are a little more complicated. 
Let Е, be the mean square found by pooling component (a) and the J 
component of the rows s.s., with an analogous definition for Ее. Cal- 
culate 


1 2k — 1 2k — 1 
Wi; =— W, = ( ) H т. = ( ) 
Е, 2kE, — E, 2kE, — E, 
QV; — W,) Wi — We 


N= ; = 
KW, + We + (Е DW] KW, + We — (k — ПИЯ 
The remainder of the analysis proceeds without change. 


, 


и 


12.24 Missing Data. The procedure for analyzing lattice squares 
when certain observations are missing has been worked out by Cornish 
(12.4). As in the case of lattices (section 10.13), the method requires 
two different estimates of each missing observation, one being that found 
by minimizing the intra-row-and-column error, the other being that ap- 
propriate to the case where the data are analyzed as a randomized block 
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experiment. In order to reduce the amount of computation, we present 
a cruder method which should be satisfactory if only a small fraction of 
the total observations is missing. 

In this approach we estimate each missing observation by the first 
method mentioned above, that is, by minimizing the intra-row-and- 
column error s.s. Thereafter the analysis proceeds as usual: 1 d.f. is 
deleted from the total and intra-row-and-column error 8.8. for each miss- 
ing value. Special rules for t-tests are given later. 

The following totals are used in the formulae for inserting estimates 
in place of the missing values. R, C, T, and P are the totals of the row, 
column, treatment, and replication, respectively, that contain the miss- 
ing observation, while @ is the grand total. Further, 


8, = total of all other rows and columns in which the treatment with 
the missing value appears (note that this sum does not in- 
clude the row and column which contain the missing value) 

', = total (from all replicates) of all other treatments which appear 
in the row or column that contains the missing value 
Z = kR, — Py + С. — Po 
With (k + 1) replicates, there is one other row with exactly the same 
treatments as C. Let Ві be its total and Р; the total of the replicate 
containing №, with similar definitions for Сз and Р». With (k + 1)/2 
replicates, Z is not used. 

Experiment with (Е + 1)/2 replications 

k(k — 1)(R + C) — (k + 3)P + 2k(k — 2)T + 6G — 255, — 2kT, 
(k — 1)(Е — 3) 

Experiment with (Е + 1) replications 
же k(k — 1)(R + C) — (k+ DP + klk — 2)Т -- 38 — #8. — К. + 2 
(k — D(k — 2) 

Experiment with less than (k + 1)/2 replications. Because of the lack 
of symmetry in this design, an additional symbol is needed. 

U, = total (from all replicates) of all treatments that appear in the 
same row or column as the treatment that has the missing 
value. 

Тһе distinction between Uz and Т, should be realized. T, is a total 
over only those other treatments that are in the actual row or column 
that contains the missing value, while Uz is a total over all other treat- 
ments that appear somewhere in the experiment in the same row or 

alue. Thus Т, is а part 


column as the treatment that has the missing V: 
of U,. It may help to note that T; ва total over 2(k — 1) treatments, 
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ог 2r(k — 1) observations, where т is the number of replications, while 
Uz is a total over 2r(k — 1) treatments, or 2r?(k — 1) observations. 


Ir — 1(R + C) — r(r + DP + k(k — 2)(r — “| 
+ (r+ 1G — 08, — ыТ, + Ы), 
(Е— 1—1) —k—r—1) 


Example. In the corn experiment in table 12.3, suppose that treat- 
ment (18) in replication I is missing. Неге k = 5, г = (k + 1)/2 = 3, 
Omitting the observation 33.3 for this treatment, we find 

Е = 125,8; С = 119.9; R +C = 245.1; Р = 721.9 
Т = 56.5; G = 21561 

S is the total of the rows and columns in which treatment (18) appears 

in squares II and ПТ. Thus 


5; = 165.1 + 145.3 + 109.8 + 126.0 = 546.2 


Finally, Т, is the total for treatments (2), (9), (11), and (25), which ap- 
pear in the row that has the missing value, plus that for treatments (5), 
(6), (12), and (24) which appear in the column with the missing value. 
These totals are to be taken from the table of treatment totals. This 
gives Т, = 709.2. Then 


Ape — (8)(721.9) + (30) (56.5) | 
-1 + Gyms) - 0006462) - 00/092 
32 


The exact formulae for t-tests are complicated, and it appears that any 
good approximate rule would also be rather complicated. The following 
tule is cuggested, though it somewhat underestimates the standard 
errors for the smaller squares and also lacks the simplicity that might be 
desired. For a comparison between the means of two treatments A and 
B, we assign to each an effective amount of replication which depends on 
the number and situation of the missing values. When scoring A we 
examine each replicate in turn and assign a score by the following rules. 


= 38.0 


Case Score to A 
I. A missing 0 
IL. A present, ard В in the same row or column as А 
i. B missing 0 
ii. B present 1 
III. A present, and В not in the same row or column as А 
1. Other values missing in both row and column X 
ii, Other values missing in row or column but not in 
both х 


iii. No values missing in row or column 1 
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The effective replication for A is of course its total score over all rep- 
lications in the experiment. В is scored similarly. Suppose that in the 
example the mean of treatment (18) were being compared with that of 
treatment (9), which occurs in the same row in square I. The effective 
replication for treatment (18) is 2, since case I arises in square I. The 
effective replication for treatment (9) is also 2, since case II(i) arises in 
square I. 
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PLANS 
Plan 12.1 3 X 3 balanced lattice square 
t = 9,Е = 3, r = 2, rows = 6, columns = 6, à = 1* 
Square I Square II 
123 168 
456 924 
7-8-0 573 
Plan 12.2 4 X 4 balanced lattice square 
t=16,k = 4, r = 5, rows = 20, columns = 20, à = 2 
Square I Square II Square III 
› 5 HH 8 1-52 "BL € 1 1-16 6 
2 6 10 14 6-578 1 22-5 15 
3 7 11 15 11 12 9 10 4 839 
4 8 12 16 16 15 14 13 7 18 10 4 
Square IV Square V 
тти и 1 10 15 8 
8 21311 9 2 7 16 
10 16 3 5 13 6 312 
15 9 6 4 5 14 11 4 


* Number of times that two treatments appear in the same row or column. 
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Plan 12.3 5 X 5 balanced lattice square 
t = 25, Е =5,r = 3, rows = 15, columns = 15, А = 1 
Square I Square II Square III 
1.9 9 4473 1 10 14 18 22 1 815 17 2 
6 B. жәй 28° 2 6 15 19 25 2 9 1l 18 
11 12 13 14 15 2024 3 ТІ 19 21 3 10 12 
16 17 18 19 20 12 16 25 4 8 13 20 22 4 6 
21 22 23 24 25 9 131721 5 7 14 16 23 5 
Plan 12.4 7 X 7 balanced lattice square 
t= 49, k = 7, r = 4, rows = 28, columns = 28, А = 1 
Square I Square II 
D Bg 2598.7. 1 38 26 14 44 32 20 
8. 9 10 11.12 18, 4 21 2 39 27 8 45 33 
15 16 17 18 19 20 21 34 15 3 40 28 9 46 
22 23 24 25 26 27 28 47 35 16 4 41 22 10 
29 30 31 32 33 34 35 11 48 29 17 5 42 2 
36 37 38 39 40 41 42 24 12 49 30 18 6 36 
43 44 45 46 47 48 49 37 25 13 43 31 19 7 
Square Ш Square IV 
1 19 30 48 10 28 39 1 42 27 12 46 31 16 
40 2 2 31 49 11 22 17 2 36 28 13 47 32 
23 41 3 21 32 43 12 33 18 3 37 22 14 48 
13 24 42 4 15 33 44 49 34 19 4 38 23 8 
45 14 25 36 5 16 34 9 43 35 20 5 39 24 
35 46 8 26 37 6 17 25 10 44 29 21 6 40 
3 29 47 9 27 38 7 41 26 11 45 30 15 7 


Plan 12.6 


PLANS 


8 X 8 balanced lattice square 
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t = 64, k = 8,7 = 9, rows = 72, columns = 72, à = 2 


Square I 


Square IT 


1 917 25 33 41 49 57 


1 10 19 28 37 46 55 64 


Square III 
1 44 62 56 27 39 18 13 


2 10 18 26 34 42 50 58 


9 251 44 61 30 23 40 


46 217 35 16 53 60 31 


311 19 27 35 43 БІ 59 


17 50 3 36 29 62 15 48 


64 23 3 25 54 12 45 34 


4 12 20 28 36 44 52 60 


25 42 35 4 21 14 63 56 


55 40 29 4 58 41 11 22 


5 13 21 29 37 45 53 61 


33 58 27 20 5 54 47 16 


28 9 50 63 5 24 38 43 


6 14 22 30 38 46 54 62 


41 26 59 12 53 6 39 24 


37 51 15 42 20 6 32 57 


7 15 23 31 39 47 55 63 


49 18 11 60 45 38 7 32 


19 61 48 14 33 26 7 52 


8 16 24 32 40 48 56 64 


57 34 43 52 13 2231 8 


10 30 36 21 47 59 49 8 


Square IV 


Square V 


Square VI 


1 60 54 40 43 15 26 21 


1 11 20 30 34 48 53 68 


1 59 52 38 42 16 29 23 


62 22511 24 37 52 47 


15 256 45 59 28 22 33 


63 232 13 19 36 54 41 


56 31 3 41 38 20 61 10 


2152 3 39 32 58 946 


53 28 3 47 40 18 57 14 


39 16 45 4 50 57 19 30 


26 47 38 4 17 13 64 51 


34 15 46 4 49 61 24 27 


44 17 34 55 5 32 14 59 


40 62 31 19 549 42 12 


48 22 39 51 5 25 10 60 


13 35 23 58 28 6 48 49 


43 25 61 16 55 6 36 18 


11 33 21 64 31 6 44 50 


27 53 64 22 942 7 36 


54 24 10 57 44 35 729 


30 56 58 17 12 43 7 37 


18 46 12 29 63 5133 8 


60 37 41 50 14 23 27 8 


20 45 9 26 62 55 35 8 


Square УП 


Square ҮШІ 


Square IX 


1 32 47 61 22 50 12 35 


1 14 24 31 36 45 БІ 58 


ет 8 


29 214 49 39 64 43 20 


12 255 48 57 27 21 38 


14 12 16 1015 9 13 11 


42 13 324 60 33 30 55 


22 49 3 37 26 63 16 44 


24 21 22 23 18 19 20 17 


59 5418 4483137 9 


32 43 33 4 23 10 62 53 


31 27 26 32 30 29 25 28 


23 36 57 46 5 11 56 26 


35 64 30 18 552 41 15 


36 38 37 33 35 34 40 39 


52 63 40 27 10 6 17 45 


27 29 60 956 63419 


45 48 44 43 41 47 46 42 


16 41 28 34 БІ 21 762 


50 20 13 59 4640 725 


38 19 53 15 25 44 58 8 


61 39 42 54 11 17 28 8 


51 55 49 58 52 5650 54 
58 57 63 62 64 60 59 61 


8 


Ріпп 18.6 


11 


ЕЕЕ ЕЕ 
Б ЕЕ 


e 


5 


12 


Square I 
I ACT ме, EO ы 9 


9 X 9 balanced lattice square 


t = 81, k = 9, = 5, rows = 45, columns = 45, \ = 1 
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Square II 


45 53 


13 


14 


16 


17 
26 
35 


35 
52 


44 


82 
а 


ElSISIEISISINIS 


Sais 


Plan 18.7 


PLANS 
11 X 11 balanced lattice square 
t = 121, k = 11, т = б, rows = 66, columns = 66, Х = 1 


Square I 

1 9 8 54—55 Gt 88 59 230 11 
12 13 14 15 16 17 18 9 20 21 22 
23 24 25 2% 27 928 79 90 31 32 93 
34 35 36 37 38 39 40 41 42 43 44 
45 46 47 48 49 50 bi 52 53 54 55 
56 57 58 59 60 61 62 63 64 65 
67 68 69 70 71 72 73 74 75 76 
78 79 80 81 82 83 84 85 86 87 
89 90 91 92 93 94 95 96 97 98 99 
100 101 102 108 104 105 106 107 108 109 110 
Til 112 113 114 115 116 117 118 119 120 121 


-i o2 в? 62 4 22 12 92 72 52 9» 
33 2 109 83 63 43 12 113 98 73 59 
54 з 3 104 84 64 44 13 114 м 74 
75 55 24 4 105 85 05 34 14 115 95 
96 70 4 25 5 106 86 66 95 15 16 
П7 97 77 46 26 6 107 87 5 96 16 
17 ив 98 67 47 27 7 108 88 57 97 
38 18 10 99 68 48 28 8 109 78 58 
50 39 19 10 89 69 49 29 9 110 79 
80 60 40 20 121 90 70 9 30 10 100 
10  ачашяйы за и 


плге TIT 
iis 108 90 88 76 58 51 M 
2 116 109 9) м 7 9 52 
з 117 110 92 85 67 00 
22 а 18 100 93 86 68 
94 87 
95 


| 


8 
м 


3 
sie 


"T 
EE 
= 
EIE 
EIE 
ёт 
as 
a| 
ЕЛ 
= 
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11 X 11 balanced lattice square 


Square IV 
1 40 68 107 14 53 81 120 27 66 94 
95 2 41 69 108 15 54 82 121 28 56 
57 96 3 42 70 109 16 5 83 111 2 
30 58 97 4 43 71 110 17 45 84 112 
113 31 59 98 5 44 72 100 18 46 85 
86 114 32 60 99 684.93. 101. 19 47 
48 87 115 33 61 89 7 35 74 102 20 
21 49 88 16 23 62 8 36 75 103 
104 22 50 78 17 24 68 91 9 37 76 
77 105 12 51 79 118 25 64 92 10 38 
39 67 106 13 52 80 119 26 65 93 11 

Square V 
1 119 105 91 88 74 60 46 43 2 15 
16 2 190 106 92 78 75 61 47 44 30 
81—17. 3121-107. 08 70.— 70 62 48 34 
35 32 18 4 1H 108 94 80 77 63 49 
50 36 33 19 5 112 109 95 81 67 64 
65 51 37 23 2 6 113 10 96 82 68 
69 66 52 38 24 21 7 114 100 97 83 
84 70 56 53 39 25 22 8 115 101 98 
99 85 71 57 54 40 26 12 9 116 102 
103 89 86 72 58 55 41 27 13 10 117 
118 104 90 87 73 59 45 42 28 14 1l 

Square VI 
1 110 87 64 41 18 116 93 70 47 24 
25 2 100 88 65 42 19 117 94 71 48 
49 26 3 101 78 66 43 2 118 95 72 
73 50 27 4 102 79 56 44 21 119 9% 
97 74 51 28 5 103 80 57 34 22 120 
121 98 75 52 2 6 104 81 58 35 12 
13 11 99 76 58 30 7 105 82 59 36 
37 14 112 89 77 54 31 8 106 83 60 
61 38 15 13 90 67 55 32 9 107 84 
8 62 39 16 114 91 68 45 33 10 108 
109 86 63 40 17 15 92 69 46 23 11 
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Plan 12.8 13 X 18 balanced lattice square 
4 = 169, Ё = 13, r = 7, rows = 91, columns = 91, à = 1 
Square I 
1 "3 78 ХХ Spe Oey БИЛИНДЕР Sito ay НЗ 
и 15 16 718 9 20 21 22 23 и 25 26 
от 28 29 30 31 32 33 34 35 36 37 38 39 
40 41 42 43 4 45 46 47 48 49 50 51 52 
53 54 55 56 57 58 59 60 61 62 63 64 65 
66 67 68 69 70 71 72 73 74 75 76 77 78 
79 80 8 82 82 84 85 86 87 88 89 90 91 
92 93 04 95 96 97 98 99 100 101 102 103 104 
105 106 107 108 109 110 111 112 113 14 115 116 117 
118 119 120 121 122 123 124 125 126 127 128 129 130 
131 132 133 134 135 186 137 138 139 140 141 142 143 
144 145 146 147 148 149 150 151 152 153 154 155 156 
157 158 159 160 161 162 163 164 165 166 167 168 169 
Square II 
1 26 38 50 62 74 86 98 110 122 134 146 158 
159 2 м 39 51 63 75 87 99 111 123 135 147 
148 160 3 15 27 52 64 76 88 100 112 124 1% 
137 i49 161 4 16 28 40 65 77 89 101 113 125 
126 138 150 162 5 17 29 41 53 78 90 102 114 
115 127 139 151 163 6 18 30 42 54 66 91 103 
104 116 128 140 152 44 7 19 31 43 55 67 79 
30 92 117 129 141 153 15 8 20 32 44 56 68 
69 81 93 105 130 142 154 16 9 21 33 4 57 
58 70 82 94 106 118 143 155 167 10 22 34 46 
47 59 Тїї 82 95 107 119 131 156 108 11 2 55 
35 48 60 72 84 96 108 120 132 144 169 12 24 
25 37 49 6l 73 85 97 109 121 133 145 157 13 
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Plan 12.8 (Continued) 13 X 13 balanced lattice square 
Square Ш 
1 24 34 44 54 77 87 97 107 130 140 150 160 
11 2 25 35 45 55 78 88 98 108 118 141 151 
152 162 3 2 36 46 56 66 89 99 109 119 142 
143 153 163 4 14 37 47 57 67 90 100 110 120 
121 131 154 164 5 15 38 48 58 68 91 101 111 
112 122 132 155 165 6 16 39 49 59 69 79 102 
103 113 123 133 156 166 7 17 27 50 60 70 80 
81 104 114 124 134 144 167 8 18 28 51 61 71 
72 82 92 115 125 135 145 168 9 19 29 52 62 
63 73 83 93 116 126 136 146 169 10 20 30 40 
41 64 74 84 94 117 17 137 147 157 п 21 31 
32 42 65 75 85 95 105 128 138 148 158 12 22 
23 33 43 53 76 86 96 106 129 139 149 159 13 
Square ТУ 
1 22 30 51 59 67 88 96 117 125 133 154 162 
1603 2 23 31 52 60 68 89 97 105 126 134 155 
156 14 3 24 32 40 61 69 90 98 106 127 135 
136 144 165 4 25 33 41 62 70 91 99 107 128 
120 137 145 166 5 2 34 42 63 71 79 100 108 
109 130 138 146 167 6 14 35 43 64 72 80 101 
102 110 18 139 147 168 7 15 36 44 65 73 8 
82 103 11 119 140 148 169 8 16 37 45 53 74 
75 83 14 112 120 141 149 157 9 17 38 46 54 
55 76 84 92 113 121 142 150 158 10 18 39 47 
48 56 77 85 93 114 122 143 151 159 11 19 27 
28 49 57 78 86 94 115 123 ІЗІ 152 160 12 20 
21 29 50 58 66 87 95 16 124 132 153 161 13 


Plan 12.8 (Continued) 


1 


20 39 


PLANS 


13 X 13 balanced lattice square 
Square V 


45 


70 


95 114 


120 139 


165 


2 21 


27 


90 96 


115 121 


147 


166 3 


22 


47 


72 91 


97 116 


142 


148 167 


79 98 


124 


143 149 


106 


125 131 


101 


107 126 


78 
55 
31 50 
82 


83 


102 108 


8 14 


78 


84 103 


159 9 


60 


66 85 


129 


154 160 


10 16 


42 


61 67 


92 


111 


136 155 


11 11 


37 
19 


43 62 


68 


87 


93 


112 


118 137 


156 162 


88 44 63 б) 88 94 113 119 138 144 163 13 


1 


Square VI 


18 35 


52 


73 90 94 111 128 132 149 166 


167 


2 19 


36 


57 14 9 


95 112 129 133 150 


151 


168 3 


20 


41 


58 


75 79 


96 113 


130 134 


135 


152 169 


4 


119 136 153 157 
120 137 154 158 
100 117 121 138 155 159 
84 101 105 122 139 156 160 
68 85 102 106 123 140 144 161 9 
69 86 103 107 124 141 145 162 10 14 
53 70 87 104 108 125 М2 146 163 11 
88 92 109 126 143 147 164 12 16 


116 


65 
49 


33 50 54 ТІ 


17 


88 42 59 76 
5 2 39 
6 23 27 4 61 78 
7 24 28 45 62 


48 60 


8 25 


80 97 
77 8 


29 46 
26 30 


114 118 
98 115 
82 99 
66 83 
63 67 
47 64 
31 48 
15 32 


34 51 55 72 89 93 110 127 131 148 165 13 
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Plan 12.8 (Continued) 
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13 X 13 balanced lattice square 
Square VII 


1 16 31 46 61 76 91 93 108 123 138 153 168 
169 2 17 32 47 62 77 79 94 109 124 139 154 
155 157 3 18 33 48 63 78 80 95 110 125 140 
141 156 18 4 19 34 49 64 66 81 96 11 126 
127 142 14 159 5 20 35 50 65 67 82 97 112 
113 128 143 М5 160 6 21 36 51 53 68 82 98 

99 114 129 131 146 161 7 22 37 52 54 69 84 
85 100 115 130 132 147 12 8 23 38 40 55 70 
71 86 101 116 118 133 148 163 9 24 39 41 56 
57 72 87 102 117 119 134 149 164 10 25 27 42 
43 58 73 88 103 105 120 135 150 165 11 26 2 
29 4 59 74 89 104 106 121 136 151 166 12 14 
15 30 45 60 107 122 137 152 167 


CHAPTER 13 


Incomplete Latin Squares 


13.1 Description 


13.11 Youden Squares. These designs, which are constructed by a 
rearrangement of certain of the balanced incomplete blocks, possess the 
characteristie *double control" of the latin square, without the restric- 
tion that the number of replicates must equal the number of treatments. 
A latin square with 13 treatments is rarely used, because it necessitates 
13 replications. There are, however, incomplete latin squares for 13 
treatments in either 4 or 9 replicates. Every treatment occurs once in 
a column (replication), and every pair of treatments appears together 
an equal number of times in the same block. Column differences are 
eliminated automatically from the treatment comparisons, while block 
differences may be removed by adjusting the treatment mean yields in 
the same way as with balanced-incomplete blocks. Most of the designs 
were developed by Youden (13.1, 13.2), whose name is commonly asso- 
ciated with them; previously Yates (13.3) had drawn attention to the 
group of designs in which the number of replicates is one less than the 
number of treatments. 

Plans are presented for three series of designs. The first are the in- 
complete latin squares themselves, formed by omitting certain columns 
in an ordinary latin square. For extra replication, the basic design may 
be repeated. A second series is constructed by adding an incomplete 
latin square to a complete latin square. This combination is sometimes 
useful when the number of treatments is small. Recently, a third series, 
called extended Youden squares (13.1a), has been developed by putting 
together two or more parts of latin squares in such a way that the 
whole design has the balanced incomplete blocks property, although the 
individual parts do not. Table 13.2 at the end of this chapter gives an 
index to the designs for numbers of treatments up to 91. 
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Some further possibilities are discussed briefly without giving detailed 
plans, These include a latin square with a row and a column added 
(section 13.342), chain block designs with two-way elimination of hetero- 
geneity (section 13.35a), and partially balanced designs (section 13.4а). 

An interesting application of the incomplete latin square for the con- 
trol of plant variability was made by Youden (13.1) in greenhouse ex- 
periments on tobacco-mosaic virus. The experimental unit was a single 
leaf, and the data consisted of the number of lesions produced per leaf 
by rubbing the leaf with a solution which contained the virus. The 
numbers of lesions had been found to depend much more on inherent 
qualities of the plant than on the position of the plant on the greenhouse 
bench. Consequently, each block of the design was a single plant, so 
that the large differences in responsiveness which existed among plants 
did not contribute to the experimental errors. The columns were the 
positions, from top to bottom, of the five leaves which were used on each 
plant. That is, the first replication contained the top leaf of every 
plant. Since there was a fairly consistent gradient in responsiveness 
down each plant, this control also proved effective. 

As in the example above, when laying out an incomplete latin square 
the general principle is to group the units so that differences among 
blocks and differences among columns represent the major sources of 
variation that are known or suspected, 


13.12 Randomization. In plans 13.1 to 13.26 and plans 13.1а to 13.5a, 
the steps are (1) rearrange the blocks of the plan at random, and (2) re- 
arrange the replications of the plan at random. 

When a plan is repeated for additional replication, blocks and columns 
are randomized separately within each repetition. The repetitions may 
be kept separate. 

In plans 13.6a to 13.17a, where the plan consists of two or more in- 
complete latin squares, randomize the blocks and columns separately 
within each incomplete square. 


13.2 Statistical Analysis 


Tn experiments with more than 10 treatments, the numbers of degrees 
of freedom for blocks'are large enough to allow the use of inter-block in- 
formation. The appropriate analysis is described in the next section. 
For smal! experiments, where inter-block information should be ignored, 
the analysis is given in section 13.22. The recommended method of 
analysis for any experiment is indicated both in table 13.2 and in the 
plans by the “type.” 
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13.21 Type I. Analysis with Recovery of Inter-block Information. 
The steps are as follows (¢ = number of treatments, К = number of units 
per block = r). 

1. Calculate the treatment totals Т, the column totals, the block 
totals, and the grand total С. 

2. For each treatment, calculate the total В, of all the blocks which 
contain the treatment. Place these quantities in a column next to the 
column of treatment totals. Then form a third column of the quantities: 


W = (t — ET — (t — 1)В, + (k — 1)6 
The W’s should sum to zero. 


3. The separation of degrees of freedom in the analysis of variance is 
as follows. 


d.f. 11.8. 
Columns (replications) (k — 1) 
Blocks (adjusted) ((—1) E, 
Treatments (unadjusted) (4-1) 
Error (k — 2)(t — 1) Е, 
Total (tk - 1) 


The total s.s. and the sums of squares for columns and treatments are 
found by the usual methods. The sum of squares for blocks (adjusted for 
treatments) is the sum of squares of the W's, divided by kt(t — k)(k — 1). 
The error s.s. is obtained by subtraction. 

4. Calculate the factor 

(Ey — Е.) 

ЦЕ — ПЕь 

The adjusted total for any treatment is 

Ү= Т+ и 

Should Е be less than Ee, и is taken as zero and no adjustments are 
made to the treatment totals. The adjusted treatment means are ob- 
tained on dividing each Y by т. The estimated error variance of the dif- 
ference between two adjusted treatment means 1s 


2Е, 
т 


и 


(i+ (@— kul 


13.21а Type У. Analysis with Recovery of Inter-block Information. 
These are the new designs formed by putting several parts of squares 
together so that the whole design is balanced. The analysis proceeds as 
in section 13.21 except for the changes noted below. 
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3. The separation of degrees of freedom in the analysis of variance is 
as follows: 


d.f. m.s. 
Columns (replications) t= 
Blocks (adjusted) (b — 1) E, 
Treatments (unadjusted) (4-1) 
Егтог ((—30-2:—0—1). E 
Total (tir — 1) 


The sum of squares for blocks (adjusted) is obtained by a round- 
about method. 


Blocks (adj.) — Blocks (unadj.) 
+ Treatments (adj.) — Treatments (unadj.) 


The sums of squares for blocks (unadj.) and treatments (unadj.) are 
computed in the usual way. To find the sum of squares for treatments 
(adj), compute for each treatment the quantity 

О =ЕТ- В, 
Then 
1 
Treatments (adj.) = am Уо? 

4. The weighting factor is 
И (-1(%- E) 

КЕ — 1)(b — 1)Es + (t — k)(b — дЕ, 

This completes the necessary changes. Note: if it is desired to analyze 
any type V design without recovery of inter-block information, follow the 


method in section 13.22, except that the partition of degrees of freedom 
is as given in this section. 


в 


13.22 Туре П. Analysis without Recovery of Inter-block Information. 
The analysis proceeds as follows: 

1. Calculate the column totals, the block totals, the treatment totals 
Т, and the grand total б. 

2. For each treatment, obtain the quantity 


Q = ЕТ — В 
where В, is the total of all the blocks in which the treatment appears. 
The quantities Q should sum to zero. 


3. The degrees of freedom in the analysis of variance are partitioned 
as in section 13.21. (For plans 13.6a and 13.7а, use the partition of de- 
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grees of freedom given in section 13.21a.) In this case the blocks s.s. is 
calculated without eliminating treatment effects, while the treatments 
s.s. is adjusted for block differences. The total s.s. and the sums of 
squares for blocks are computed by the standard procedure. The treat- 
ments s.s. (adjusted) is the sum of the squares of the Q’s, divided by (kA. 
The error s.s. is found by subtraction. 

4. In order to obtain any treatment mean, adjusted for block dif- 
ferences, divide the corresponding Q by А and add to the quotient the 
mean for the whole experiment. В 

The estimated error variance of the difference between two adjusted 


treatment means is 
2E, k(t — 1) 


“т k=) 

Numerical examples of this analysis for { = 7, г = 3, and ¢ = 21, 
r = 5 are given in reference (13.1), and an example for ¢ = 6, r = 5 in 
(18.8). 


13.23 Туре Та. Repetitions of Type I. Let л be the number of repli- 
cations in the basic plan, which is used p times іп an experiment, so that 
the total number of replications r = np. 

The analysis presented here may be used with repetitions of any of the 
type I and П designs, since even with small numbers of treatments the 
repetition provides sufficient degrees of freedom for estimating the inter- 
block variation. If, however, the efficiency factor exceeds 95%, it is 
searcely worth while to utilize inter-block information. 

The changes necessary in the procedure of section 13.21 are outlined 
below, the numbers referring to the steps in that section. 

1. Unchanged. Corresponding to any block, there are (p — 1) other 
blocks which contain the same set of treatments. The block totals 
should be arranged in a table (table A, say) with ¢ rows and p columns. 

2. Unchanged. Note that the calculation of the quantities В, is ex- 
pedited by the use of the row totals of table A. 

3. The analysis of variance is as follows. 


d.f. m.8. 

Columns (replications) (kp — 1) 
Blocks 

Component (a) (n—-1(-1) 

Component (b) (4-1) 
Block total p(t — 1) By 
Treatments (4-1) 
Error (—ПФЕ-Р-П E. 


Total (pkt — 1) 
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In this analysis, the (p — 1) degrees of freedom among repetitions 
have been ascribed to the columns, so that the blocks s.s. is actually a 
sum of squares among blocks within repetitions. 

Component (a) of the sum of squares for blocks consists of comparisons 
among blocks which contain the same set of treatments and is the inter- 
action s.s. for table A. 

Component (5) is the sum of squares of the W’s, with a divisor 
pkt(t — k)(k — 1). 

4. Unchanged except that 


ae! c Бақа 
Ж РКК — 1)E, — (4- k)(p — ПЕ. 


The estimated error variance of the difference between two adjusted 
ireatment means is : 


2E, 
= [1 + (t — n)a] 


13.24 Type Ila. Repetitions of Type II. As we have pointed out, 
in designs where the efficiency factor is high, inter-block information 
can be neglected. The changes required from section 13.22 should 
present no difficulty. The degrees of freedom subdivide as in section 
13.23. The p(t — 1) degrees of freedom among blocks within repetitions 
are calculated in the usual way, each repetition furnishing (¢ — 1) de- 
grees of freedom. Notice that the divisors for the treatments s.s. and 
for the quantities Q must be increased by the factor p. 


13.25 Missing Data. The formula for inserting an estimate in place 
of a missing observation is obtained by minimizing the intra-block error. 
This is the correct estimate for the smaller experiments where inter- 
block information is ignored. For reasons given in section 10.13, this 
estimate will also be used to provide an approximate solution when 
inter-block information is recovered. The formula is quite similar to 
that for balanced incomplete blocks, with the addition of an extra term 
involving the column (or replication) total. 

Let C, B, and T be the totals of the column, block, and treatment that 
contain the missing value, and let G be the grand total. Further, as in 
the statistical analysis, let B, be the total of all blocks in which the 
treatment with the missing value appears (there will, of course, be r such 
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blocks). Finally, 
Т” = total (over all replicates) of all other treatments that appear in 
the block which has the missing value 


В! = total of the B, values for all other treatments that appear іп the 
block with the missing value 


The estimate z of the missing value is 


г МОЕ ШЕ а ъв, ЗВ 
r(r — 1)(r — 2) 


The symbol А is given for each design in table 13.2. Care must be 
taken not to confuse the block totals with the column totals. 
If the experiment contains p repetitions, the formula is 


‚_ РӘС + РАВ + k(k ОТ — РЕ — kT" — (k — 1)B: + ВИ 
r(k — 1)(pk — p — 1) 


where R is the total of the repetition in which the missing value occurs, 
and А is as in table 13.2. 


43.3 Other Designs for Small Numbers of Treatments 


13.31 Description. When the number of treatments is small, it is 
sometimes useful to have a design of the “Jatin square” type in which 
the number of replicates exceeds the number of treatments. Some de- 
signs can be obtained by repetition of an ordinary latin square or of a 
Youden square. Additional plans can be constructed by adding a You- 
den square to a latin square. 

Plans are available for most numbers of replicates up to 10. Actually, 
plans can be made for any number of replicates up to 10; those shown 
here have been selected for ease of analysis. 


13.32 Type Ш. Statistical Analysis When k = it — 1. In this case 
the size of block, which equals the number of replicates, is 1 less than 
some multiple (2) of the number of treatments. Since these designs have 
not been discussed in the literature, an example is given from uniformity 
data, Table 13.1 shows the tomato yields in pounds of 28 single-row 
plots, Hartman and Stair's data (13.4), on which the design for 4 treat- 


ments in 7 replicates has been superimposed. 
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TABLE 13.1 INCOMPLETE LATIN SQUARE FOR 4 TREATMENTS IN 7 REPLICATES 


Treatment symbols and yields of tomatoes (pounds) 


Column 
Block I II IH IV V VI ҮП Total 
1 2 2 4 4 Б] 3 1 
50 72 83 82 76 89 74 526 
2 1 3 3 1 4 2 4 
40 59 71 91 59 73 52 445 
3 4 1 2 3 1 4 2 
43 57 58 98 54 71 51 432 
4 Б] 4 1 2 2 1 3 
48 54 74 97 75 75 54 477 
Totals 181 242 286 368 264 308 231 1880 


Adjusted means 


Block Т. О = 07 +В 0 + 48 (subtract 11.2) 
1 465 3,781 78.8 67.6 
476 3,777 78.7 67.5 
3 495 3,897 812 70.0 
4 444 3,585 74.7 63.5 
Check totals or mean 1,880 15,040 67.2 


Since the plots measured 6 feet X 24 feet, each column (replication) 
is compact, being 24 feet square. There are obviously substantial dif- 
ferences among replicates. The blocks, although very oblong, may also 
exhibit differences in yield, because each block is a separate row of plants. 

The efficiency factors of these designs are all very high, so that inter- 
block information is ignored in the analysis. 

The steps in the analysis are as follows: 

1. Calculate the column totals, the block totals, the treatment totals 
Т, and the grand total G. 

2. A property of these designs is that each treatment is replicated less 
in one block than in the other blocks, Thus, in the example, treatment 
(1) appears only once in the first block, but twice in all other blocks. 
Similarly, treatment (2) is deficient in block 2 and soon. For each treat- 
ment calculate the quantity 


Q=kT+B 


where B’ is the total for the block in which the treatment is deficient. 
For instance 
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Qi = 7 X 465 + 526 = 3781 


As a check, the quantities Q should sum to (k + 1)G. 

3. All sums of squares in the analysis of variance are obtained in the 
usual way except that for treatments, which is given by the sum of 
squares of deviations of the Q's, divided by k(k? — 1). In this case we 
have 

(3781)? + (3777)? + (3897)? + (3585)? — $(15,040)? 
336 


The analysis of variance is shown below. The degrees of freedom sub- 
divide as in section 13.21. 


149 


d.f. 8.8. m.s. 

Columns (replications) 6 5387 897.8 

Blocks 3 750 250.0 

Treatments 3 149 50.0 

Error 15 767 51.1 
Total 27 7053 


The elimination of block differences has substantially reduced the 
error 0.8. 

4. To obtain the adjusted treatment mean yields, we first divide each 
Q by (k? — 1), in this case 48. From the resulting quantities we sub- 
tract G/tk(k — 1), or 1880/168 = 11.2. As a further check, the mean 
of the adjusted treatment means should equal the mean yield of the 
whole experiment (67.2). The efficiency factor is (k? — 1)/k?, and the 
estimated error variance of the difference between two adjusted treat- 


ment means is 
2Ет 


P-1 


13.33 Туре IV. Statistical Analysis When k = й +1. The analysis 
closely resembles that of section 13.32. For each treatment there is one 
block in which the treatment has extra replication. ‘Thus, in plan 13.17 
for { = 3, т = 7, treatment (1) appears 3 times in the first block but only 
twice in any other block. The only changes in the computing instruc- 
tions of section 13.32 are (i) 0 = kT — В’, where В” is the total of the 
block in which the treatment has extra replication, (ii) the Q's sum to 
(k — 1)G, and (iii) for the adjusted treatment means, divide Q by 
(k? — 1), and add to the quotient the quantity G/tk(k + 1). The change 
in the divisor of G should be noted. 1 

The error variance of the difference between two adjusted means re- 
mains as at the end of section 13.32 since k = т for these plans. 
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13.34a Other Modifications of the Latin Square. There are other 
modifications of the latin square for which the analysis is slightly more 
complicated. These designs may occasionally supply a need in situations 
where variations in two directions at right angles are encountered. 
Latin square with а row and column added. Table 13.1a shows an ex- 
ample (not randomized) with 4 treatments. Treatments (1), (2), and (3) 
are replicated 6 times, but treatment (4) has to be replicated 7 times, 


TABLE 13.la A 4 X 4 LATIN SQUARE WITH A ROW AND COLUMN ADDED 


128341 d.f. 
2324112 Rows 4 
84123 Columns 4 
41234 Treatments 3 
12344 Error 13 

Total 24 


Latin square with а column added and а row omitted. In this design for 
5 treatments in a 6 X 4 rectangular array, the first 4 of the treatments 
have 5 replicates each; however treatment (5) has only 4 replicates, 


TABLE 13.2a A 5 X 5 LATIN SQUARE WITH A COLUMN ADDED AND A ROW 


OMITTED 
123451 d.f. 
2,8; 4.5.1.3 Rows 3 
345128 Columns 5 
4512834 Treatments 4 

Error 11 
Total 23 


Latin square with а тош and column omitted. This plan accommodates 
6 treatments in a 5 X 5 square array. One treatment, in this case (5), 


TABLE 13.38 А 6 X 6 LATIN SQUARE WITH A ROW AND COLUMN OMITTED 


12345 df. 
23456 Rows 4 
3. 4.5.6 1 Columns 4 
4 D 5 l1 2 Trestments 5 
ONCE NC 8 Error 11 


Total 24 
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has one extra replicate. The statistical analysis for the first two designs 
above is given by Pearce (13.2a), who proposed them for certain situa- 
tions in horticultural research. For example, the trees available for an 
experiment may lie in a 6 X 4 rectangular array in which there is a 
likelihood of gradients along both rows and columns. A new experiment 
may be started on a site which previously held an old experiment in 
randomized blocks. Since some residual effects of the old treatments 
may persist, a design of the latin square type suggests itself, with the old 
blocks as rows and the old treatments as columns. Yates (13.3a) de- 
scribes the analysis for the designs illustrated in table 13.3a. 


13.35a Generalized Chain Block Designs. Mandel (13.4a) has shown 
that a few of the chain block designs (section 11.7a) can be written so 
as to adjust for differences between rows and columns. Table 13.4a 
shows a design with this property for 8 treatments. If the columns are 


TABLE 13.4& A GENERALIZED CHAIN BLOCK DESIGN 


Blocks (columns) 
Row|1 2 3 
1 АС ЕТ 
2:1 | Вор РИ 
SULLO. AR. GUB 
4! | DARE HWA 


regarded as blocks, this is а typical chain block design. Treatments 
(C, D) are the link between blocks 1 and 2, (E, F) between blocks 2 and 
3, (G, H) between blocks 3 and 4, and (A, B) between blocks 4 and 1. 
Now examine the arrangement from the point of view of the rows. 
Treatments (C, G) form a link between rows 1 and 3, (B, F) between 
3 and 2, (D, H) between 2 and 4, and finally (А, E) between 4 and bs 
Thus the chain in the rows goes 1, 3, 2, 4, 1. The arrangement is a 
chain block design for both rows and columns, and treatment means 
may be adjusted to remove the effects of the rows and columns in which 
they lie. 

This design was used to compare the wearing qualities of 8 tires. The 
columns represented the 4 positions which a tire can occupy on the car, 
and the rows were four different test runs by the same car. The design 
provides only 2 d.f. for error, but this test was part of a larger one in- 
volving 32 tires. 

Mandel gives designs of this type for 12, 16, 18, 20, 24, and 30 treat- 
ments, with instructions for analysis. 
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13.4а Partially Balanced Designs 


13.41a Description. Many of the partially balanced designs recently 
published by Bose ей al. (13.5a) can be arranged so as to allow the 
elimination of variation in two directions at right angles. Table 13.5a 
shows this arrangement for the design with ¢ = 15, k = 4, that was 
used in section 11.63a to illustrate the analysis of the results of a pbib 
experiment. The rows represent the incomplete blocks and the columns 
are replications. 


TABLE 13.5a А pbib DESIGN IN INCOMPLETE LATIN SQUARE FORM 


t=15,k=4 
Replications 

Block I Il ш IV 
(1) 1 3 4 12 
(2) 2 4 5 13 
(3) 3 5 6 14 
(4) 4 6 7 15 
(5) 5 7 8 1 
(6) 6 8 9 2 
(7) 7 9 10 3 
(8) 8 10 11 4 
(9) 9 11 12 5 
(10) 10 12 13 6 
(11) 11 13 4 7 
(12) 12 14 15 8 
(13) 13 15 1 9 
(14) 14 1 2 10 
(15) 15 2 3 11 


Some of the pbib designs can be set out so that two or more columns 
form a replieation or group of replications, as illustrated by the plan in 
table 13.6a for 8 treatments in incomplete blocks of size б. This design 


TABLE 13.6a А pbib DESIGN IN WHICH A PAIR OF COLUMNS FORMS A 


REPLICATION 
t=8k=6 
Replication 

Block I п ш 
(1) т net" TUS в 
(2) 2r клен ұйы? 
(8) S UTD os tt 5 
(4) 4 1818728076 vy 
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is of the “group divisible” type, with the following association scheme. 


1 5 
И] 
Б] 7 
A 748 


Treatments in the same row of the association scheme (like 4 and 8) 
occur together in three of the blocks; treatments not in the same row 
occur together in two blocks. 

Finally, in some designs a group of rows (incomplete blocks) forms a 
replication, while at the same time a group of columns also forms a 
replication. Table 13.7a for = 12, k = 6, isan example. This design 


TABLE 13.7a A pbib DESIGN IN WHICH PAIRS OF ROWS AND PAIRS OF COLUMNS 
FORM REPLICATES 


t=12,k=6 
Replication 
Reps. Block I II ш 

1 (1) 1 2| 5 6| 9 10 
(2) 3 4| 7 8111 12 

п (3) 9 11 1 3| 5 7 
(4) 10 12| 2 4| 6 8 

ш (5) 5 8|r9^ 12| 1 4 
(6) 6 7|10 in| 2 3 


is also “group divisible,” treatments in the same row of the association 
scheme appearing together in three blocks, and treatments in different 
rows appearing once in a block. The association scheme is 


135 9 
в Ш 
даа ДЕЛ 
4. ul 


Ап index to the available pbib designs of the types illustrated above 
has been given in table 9.2a, p. 392. Randomization and statistical 


analysis are described by Bose (13.5а). 
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TABLE 13.2 INDEX TO PLANS 
(Combines tables 13.1 and 13.2 of first edition) 


k r b by E Plan Туре f 
3 6 6 6 1.00 2LS 

3 9 9 9 1.00 3Ls 

5 5 3 5 .96 13.16 III 
5 10 6 10 .96 13.16 Ila 
T 7 3 7 .98 13.17 IV 
8 8 3 8 .98 13.18 ш 
10 10 3 10 .99 13.19 IV 
3 3 4 2 .89 ы п 
3 6 8 4 .89 м Та 
3 9 12 6 89 ыы Та 
4 8 8 8 1.00 218 

5 5 4 5 96 13.20 IV 
5 10 8 10 96 13.20 Па 
7 7 4 7 98 13.21 ІШ 
9 9 4 9 99 13.22 IV 
2 4 10 1 67 13.6a II 
3 6 10 3 13.7a II 
4 4 5 3 .94 * II 
4 8 10 6 94 “. Та 
5 10 10 10 1.00 218 

6 6 5 6 97 13.23 ТУ 
9 9 5 9 .99 13.24 III 
5 5 6 4 .96 * II 
5 10 12 8 96 ** Та 
7 7 6 7 98 13.25 ТУ 
2 6 21 1 58 13.8a У 
3 3 7 1 78 13.1 I 
3 9 21 3 78 13.1 Ta 
4 4 rà 2 88 13.2 II 
4 8 14 4 88 13.2 Ta 
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TABLE 13.2 Inpex то PLANS (Continued) 


t k т b à Е Plan Туре f 
7 6 6 7 5 .97 * п 
8 8 7 8 .98 18.26 IV 
8 7 7 8 6 .98. * II 
9 2 8 36 1 56 13.98 у 
4 8 18 3 84 13.10a у 
5 10 18 5 .90 13.114 у 
8 8 9 7 .98 ы п 
10 3 9 30 2 ЛА 13.124 у 
9 9 10 8 .99 d II 
п 2 10 55 1 55 13.134 у 
5 5 1 2 88 13.3 I 
6 6 11 3 .92 13.4 I 
10 10 11 9 .99 * II 
13 3 6 26 1 72 13.14a у 
4 4 13 1 .81 13.5 ї 
9 9 13 6 .95 13.6 I 
15 7 7 15 3 .92 13.7 I 
8 8 15 4 .94 13.8 I 
16 6 6 16 2 .89 13.9 І 
10 10 16 6 .96 13.10 I 
19 3 9 57 1 70 13.15а у 
9 9 19 4 .94 13.11 I 
10 10 19 5 :95 13.12 І 
21 5 5 21 1 .84 13.13 if 
25 4 8 50 1 78 13.16a M 
9 9 25 3 .93 13.18 T 
31 6 6 31 1 .86 13.14 1 
10 10 31 3 .93 13.28 І 
37 9 9 37 2 .91 13.15 I 
41 5 10 82 1 .82 13.178 M 
57 8 8 57 1 .89 13.38 І 
73 9 9 73 1 .90 13.48 I 
91 10 10 91 1 .91 13.58 1 


* Constructed from a t X t latin square by omission of the last column. 

** By repetition of the plan for т =t — 1, which is constructed by taking a 
1 X t latin square and omitting the last column. 

t This refers to the method of analysis. For types I, Ia, 
tion 13.2; for types III and IV, see section 13.3. 


II, Па, and У, see sec- 
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PLANS 
Plan 18.1 ет, =3,7r =3,b = Т,\ = 1, Е = 78, Type II 
Reps. 

BlokI ПІ ІП 
ит" “4 
(2) 1 2 4 
(8) 2 3 5 
(4) 3 4 6 
(5) "à %.р» 7 
(6) 5 6 1 
(7) 6 T 2 


Plan 18.2 t=7,k =4,r =4,b =7,d = 2, = 88, Type П 


Reps. 
BlokI П Ш IV 
(1) b. 6 T 
(ох GF 7 TL 
ms T' 1 4 
4 6 1 2.8 
(7 9 8 v4 
(0:1 8 415 
(0 № 576 


Plan 13.3 t=11,k =5,r = 5,b =11,^ = 2, Е = 88, Туре I 
Reps. 


Blok I П II IV V 
09.1 201.8 & 5 
qeu 1,6 10. 8 
(3) 9 1 2 
(А4 ' 9-7 us 
(5 10 410 5458321 
(Gr 9% ОА 
ҮДЕ қы ы Ию 
(@.% 85%И. b" 9 
@ 3 4 0 


o 
© 
o 

Np) oO 


(10) 
an 


> 
ex 
oo 
1-3! oo 


PLANS 
Plan 13.4 1= 1,6 =6,7=3,b = 11,А = 3, Е = 92, Туре I 


Reps. 

Block I IL II IV V УІ 
(1): увер 
(2) 5 8: AH 2 —9 
3 4 $; 71 92-11 10 
(à) 3 10 2 6-=5—% 
(5) 2 РЕ Аа МЫ! 
(6) 1 6 10 С ТШ / 5 
кегі шесі алаш”? 
9-80 
OTa 5 1 а 

(10) 11 4 6 8 1 3 
(11) 10 2-22 2 3 1 


Plan 185  *=18,k=4,r=4,b =13,Х =1,Ё = 81, Type I 


Reps. 
Block I П IH IV 
ОЕ antera de) 
Q 1 2 4 1 
—2-—3, ТӨ 
тя 46 2 
O A eas 
Oe 
(—6— 1— 9— 13 
(—1—-—8—10—8 
09—8—9—11—14 
(0) 9 10 12 5 
ay 10 H 13 6 
а-а-а: 4 


LI 
© 


(13) 12 13 
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Block 
(1) 
(2) 
(3) 
(4) 
(5) 
(6) 
(7) 
(8) 
(9) 

(10) 
(11) 
(12) 
(13) 


Plan 18.7 
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t=13,k =9,r =9,b = 13, = 6, E = 95, Туре I 


Reps. 
ТУ Тр УУГ. VV Дх 
2 E 5. Тио9 10. Ш 12 13 
5 6 7 и 13 1 
Бы то 1* 1 3 
Е В 9 40.212 718. Ww а 3 
қ 8 30 311.438 1313. 7 8 4 
Ша № 4 5 
&_1ї Ий 52233 Чч Б в 
бл еее 
ae: пот 70 7 8 
в 9 
13:13 > ПЖ? ТЖ 82558709 10 
8 8 4 Aue 8 9 10 1 
T #55 °° Br 9,"10 13 


L=15,k =Т,т ТЬ = 15, =3, Е = 92, Type I 


Reps. 

Block, ТО ШШ Iv. У УІ ҮП 
() 1591.8. 39— 8. (7 1 9 
(2 5 14 10 7 12 2 8 
w Beas it 8 2 6 
(4) 12 1 6 9 2 4 14 
D в ЕЛ ЖАР 795 16 
(6) 11 9 7 2 13 15 5 
(e Dod 406 T 
(8) 2 3 1 18 15 14 12 
(9 8 6 4 15 10 13 2 

(10) 10 4 5 1 1 12 18 
шш 9.19. 14410 95 3 
(12) 14 T «18 3 4 8 п 
(13) 7 15 9 12 3 10 4 
(14) 3 1 2 8 9 11 10 
(15) 6 10 15 14 1 T 1 


PLANS 
Plan 193.8 t= 15,6 = 8,r = 8, b = 15, à = 4, Е = 94 Type I 


Block I II IH IV V VI VII ҮШ 
(Wee и. Га. ГҮ... ЛГ 
(2). Sa Se aR ГИ АБЦ) 
(9. 9 4 AL OED 
(0.515. ЕТЕР ЕДУТ 


(9) —- 5.234252 Вр 
(10) 8 15 6 ый 


Red 
(19) —92-2:6.215 2:9 7 2 5.2002 D 
(13) 13 52110165506 e 


Ріап 18.9 t = 16, k = 6, r = 6, b = 16, à = 2, Е = .89, Type I 
Reps. 
Block Г II II IV V УІ 
Q4 2 3 0465 


0—5 78-9710 ГЕ 
oe Д9 Е Ба ы 
0 ЕЧ Ий ШАР Е аз... 
(9) 5 12 14 101—106 9 


o $0 153 1 35 
(D 438-815 705-0 
9 Se Hu cs 38 
(9 9 1 n 5 13 2 
(0 10 1 6 12 2 

(пуд АМ 3 9 ВЕ 
(2) 12 3 10 15 8 5 
1931 of 5 3 
14 1 19 5 10 7 4 
0010—0774 87247 и 
ви 
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Plan 18.10 


Plan 13.11 
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1 = 16, = 10,7 = 10, = 16^ = 6, Е = .96, Туре. 


Block 


(1) 
(2) 
(3) 
Nu 
(5) 
(6) 
@) 
(8) 
(9) 
(10) 
(1) 
(12) 
(13) 
(14) 
(15) 
(16) 


Reps. 
I H HI v УІ УПҮШІХ x 
8 


7 9 10 1 12 13 14 15 16 
3—4 65513 316-311 12 6 14 15 
972450008 1035515 216 М 
2307.95 121152510 9-16 412 19 
6787321531457 35x10 213 :11 
4— 55 А2 9779-49 7 2 ДЛ > 12 
1 710 541 45118-25-5,-0-1.13 ee 9 
5 1516: 661696 8 3t 9 7 10 
16 3 12186-66 8 10 id 

7739 $10 $13 109g 4 3 5 9 8 
ТУСЕ к өш йө Т. 1 9-87-18 6 5 
14 n 1316 9 1 he 7 4 6 
1071551292771 5 16 fe 11 5 4 
Hn 1 6 9 8 5 16 1 2 3 
13.10 8 H 10 14 5 3 1 2 
15— 9— 107 12 18 14 4 3 1 


1=19,k =9,т = 9,0 = 19, =4, Е = .94, Type I 


Reps. 


Block Sur ЛУ vr уп уш IX 


NT e es A. ү. 8 9 
у АЛ БОША 21 12 11 10 13 
@ лы 2 10 6 o n 
(4) 12 13 16 [hu as 35 8 
(Б) 9 10 12 16 1 $. 17 19 7 


Plan 18.18 


PLANS 


t= 19, k = 10,r = 10,b = 19, = 5, Е = 95, Type I 


Blok Т H I IV V VI УШ ҮШ Ix 


(а) 
(2) 
(3) 
(4) 
(5) 
(6) 
(7) 
(8) 
(9) 
(10) 
(11) 
(12) 
(13) 
(14) 
(15) 
(16) 
(17) 
(18) 
(19) 


я 


10 11 12 13 М 15 16 17 18 19 
19 5-2 6З Ту 3129 32155,1060 717 118 


18 19; уе о 18 14 
14 ЗИГЕР ЗІ бо 10 
13 2551537142718 2774€ 15255556 alt 8 
12 Мт ЛОБЕ О 9 6 
15 183; 1172 2420121520 $8.10 8 9 
16 1051119723. 4813 8 5 
11 167 ТИВ 7 
1 10715;,:11,:3192 E ae 
17 1 TE ra a Б 4 


ия 18 
10 18 8 
13 М 18 15 10 
UR argui: 0 65 М 
я 117 711 
10 1 19 
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= 
e 
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527 


528 INCOMPLETE LATIN SQUARES 
Plan 1818 t=21,k =5,r = 5,0 = 21, à = 1, E = 84, Type I 


Reps. 
Bloa&k I II II IV V 
(09219 125, 477 14 2516 
TARET ық” 157 11 
i OX RET 10 18 
OL Bice TS 19 
() 4 5 8 18 20 
(ФЕ R50. 219. 21 
лу a ae PST 
(R05 82.11 "121." 2 
EE тала L*"'.3 
0 9 10 13 2 4 
12107 1L7J4. 8.7.5 
(110 13.15 4" 6 
З) Dee eT a 5". 7 
(à 13417 6 8 
(15) М 15 18 7 9 
(16) 15 16 19 8 10 
шылар. 971 
(18) 17 18 21 10 12 
(197718.7^19 "1^ 11 * 18 
(0) 19 20 2 12 M 
о) 20 21 3 13 15 


PLANS 


Plan 131а і = 25, k = 9, r = 9, b = 25, =3, Е = 93, Type I 
Reps. 

Block I П Ш IV V VI VH ҮШ IX 
(D 15.2.34 £5 SP 5n te 9559 
(2) 2 4 9 10 24 17 15 22 12 
(3) 3 24 8 23 18 21 18 4 10 
(0 4 22 25 8 20. 12. 11 б RU 
(5 52151118 8 п 2 13 20 
(60 6 8 12 13 1 14 24 25 15 
(0) 7.016 5 22 3 10.25. 15 18 
(8 8 10 п 16 6 22 23 17017 
() 9 18 20 5 12 28 1 21 22 

(10 10 19 14 12 16 2 8 5 21 
(1 11 18 19 24 10 1 5 9 25 
(12) 12 6 10 25 7 18 20 2 23 
(13) 13 11 4 9.23 25 14 16 72 
(14) 14 3 75:17 $111 5 12 23 24 
(15 15 20 21 1 14 КЦ 4 
(16) 16 17 18 7 18 19 4 12 1 
(17) 17 14 13 20 19 $3 9 10 6 
(18) 18 9 22 14 25 82:21,417 7 
(19) 19 7 28 15 9 20 16 2 8 
(20) 20 5 16 6 4 24 22 14 18 
(21) 21 12 6211215 9 з 18 16 
(22) 22 21 24 19 2 13 6 ТА 11 
(3) 23 1 2 3 22 15 18 19 14 
(24) 24 25 T 2 21 16 17 20 3 
(25 25 23 15 21 17 4 19 6 5 


529 


530 
Plan 13.14 


INCOMPLETE LATIN SQUARES 


t =31, k = б,т = 6, = 31, à = 1, =.86, Type I 


Reps. 

BE У y "VI 
(1) 31 1 3 8 12 18 
2 1 2 4 9 18 19 
(3) 2 3 5 10 14 20 
(ауан та алага ту ор 
(5) 4 5 ТАЛДА CR 22 
() 5 6 Le es 22 
ПОЗА BTA 
(В 0, 288/510 2 15.119." 25 
(9) 8 027 112.:16.2-20.-* 26 

(10): 5952102 127—*17.--.21 27 
(тор 142751847718” -22 58 
(121211 102 147512072522 29 
(13) 12 18 15 20 24 30 
(14/-18- 14 716 “21” 25° 81 
(15) 514 15 "17" 22."*26 1 
(10) 15.7 416.2 18,1 23.27 `2 
(17) 167 рр * * 192924. *.28 3 
(18/717. 18.9890 2595 29 4 
(19) 18 19 21 26 30 5 
QUI 1009290 :22* 277 "231 6 
(Q1 EP ы ЖАЙ TAR пе 
(22 21 22 24 29 2 8 
(23) 22 23 25 30 3 9 
(24) 23 24 26 31 4 10 
(25) 24 25 27 1 5 11 
(26) 25 26 28 2 6 12 
(27) 26 27 29 3 7 13 
(28) 27 28 30 4 8 14 
(29 28 29 31 5 9 15 
(30) 29 30 1 6 10 16 
(31) 30 31 2 € D it 


Plan 13.2a 


PLANS 


t = 31, Е = 10,7 = 10, b = 31, = 3, = .93, Type I 


Block 
(1) 
(2) 
(3) 
(4) 
(5) 
(6) 
(7) 
(8) 
(9) 

(10) 
(1) 
(12) 
(13) 
(14) 
(15) 
(16) 
(17) 
(18) 
(19) 
(20) 
(21) 
(22) 
(23) 
(24) 
(25) 
(26) 
(27) 
(28) 
(29) 
(30) 
(31) 


Reps. 
ТП П.Ў RXL VIL IX ұх 
1 2 4 8 95242155 lof 18 “98 
2 37" Ло оа 10S eee 19. 5” 22 
3 275 20.1027 17 91929 07 м А111 23 
4 ин. dg" 24 
5 55 Е 19” 25 
6 771272162147 50200-21 2 26 
7 1 1514 sar DA 97 
s пи S616 Cor oe T$». 297 5 
9 312^ 24* 297 20.718 9.060 ЛГ 6 
10 13 18 19 29 25 2 27 28 Li 
i 14 232 26 19 20 3 28 29 1 
12 8 27 23 20 20 4 22 21 2 
13 9 239 28 21 15 5 23 24 3 
14 10 25 22 15 16 24 29 6 4 
15 24 207 5 2* 27 31^ 10 30 20 
16 35 6 30 3 28 12 11 27 21 
17 26 28 7 30 22 18 12 4 15 
18 27 28 1 5 30 14 13 22 16 
19 28 30 2 6 M 8 24 282,217 
20: 22 94 726.75 26% 04430 25 18 
оқ -92-10 24 17 267 30" 25 9 19 
229 2] 31578924 1 25 2 31 18 
22 15 3 18 25 2 26 oh 12-7 14 
24 16 19 31 26 3 27 4 13 8 
22:17 14 27 4 28 5 20 9 
20 18 5 21 28 31 22 6 8 10 
27 19 31 15 22 6 23 9 quu 
28 20 16 24 23 Т® ФЕ 1? 102 12 
29 30 2 6 4 5 Же 8 1 31 
30 31 9 13 11 12 10 gu 142) 20 
31 29 21 20 18 19 17 15 16 30 


531 


532 


Plan 18.15 


Block 
(1) 
(2) 
(3) 
(4) 
(5) 
(6) 
@) 
@ 
(9) 

(10) 
(11) 
(12) 
(13) 
(14) 
(15) 
(16) 
(17) 
(18) 
(19) 
(20) 
(21) 
(22) 
(23) 
(24) 
(25) 
(26) 
(27) 
(28) 
(29) 
(30) 
(31) 
(32) 
(33) 
(34) 
(35) 
(36) 
(37) 


INCOMPLETE LATIN SQUARES 


п 


ІУ 


УТ 


t =37, k = 9,т = 9,Ь = 37, à = 2, E 91, Type I 


уп уш IX 


10 


16 


26 


33 


34 


11 


17 


27 


84 


12 


18 


85 


13 


19 


36 


w 
я 


14 


e 
* 


15 


16 


17 


I 
1 
2 
3 
4 
5 
6 
7 
8 
9 


18 


10 


19 
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Рип 13.380 t= 57, k = 8,r =8,b = 57, = 1, E = .89, Type I 
Reps. Reps. 

Block I II IH IV V УІ ҮП VII Block I П Ш IV V УІ VII VIII 
а) 1 2 3 4 5 6 7 50 (300 30 40 43 55 14 17 27 4 
(2) 2 51 16 23 30 37 9 44 (31) 31 41 44 5 8 18 55 28 
(3 3 10 51 24 31 38 45 17 (32) 32 42 55 6 9 19 22 45 
(à 4 1 18 51 32 39 46 25 (33) 33 36 46 7-55 20 233 10 
(5) 5 12 19 26 51 40 33 47 (34) 34 55 47 a 397 34 
(0 6 13 20 27 34 м 48 41 (35) 35 98 48 2 15 55 25 12 
(7) т 14 21 28 35 42 49 51 (36) 36 37 50 39 40 41 42 38 
(8) 8 9 10 112-14-50 13 (37) 37 46 6 8 17 26 35 50 
(9) 9 15 28 34 52 46 40 3 (38) 38 47 7 9 18 56 29 27 

(10) 10 52 22 35 41 47 4 16 (39) 39 48 110 10 28 56 30 
а) 11 17 23 29 42 52 5 48 (40 40 49 2 56 20 22 11 81 
(2) 12 18 52 30 36 49 24 6 (41) 41 56 12 43 21 23 3 32 
(13) 13 19 25 31 37 43 52 T (42) 42 44 4 13 56 24 15 33 
(14) 14 20 26 32 38 44 1 $2 (43) 43 50 45 46 47 48 44 49 
(15 15 16 17 18 19 50 21 20 (44) 44 11 57 27 35 19 36 
(16) 16 28 33 38 53 1l 6 43 (45) 45 4 57 20 28 29 12 97 
(7) 17 22 34 12-4 7 39 53 (46) 46 5 13 21 57 30 38 22 
(18) 18 53 35 40 45 1 13 23 а 7 6 14 15 23 31 57 39 
(19) 19 24 29 41 46 2 53 14 (48) 48 7 8 16 24 57 32 40 
(20) 20 25 30 53 3 8 47 42 (49) 49 1 9 17 25 33 41 5 
(21) 21 36 53 36 48 4 31 9 (50) 50 57 56 52 54 53 м 55 
(22) 22 23 24 25 50 27 28 26 (51) 51 15 22 29 36 43 

(23) 23 34 38 49 4 54 $8 19 (52) 52 21 27 33 39 45 2 8 
(24) 24 35 39 54 43 9 20 5 (53) 53 27 32 37 49 5 10 15 
(25) 25 29 40 44 6 10 54 21 (54) 54 33 37 48 Tne ee Le 
(26) 26 54 41 45 7 15 30 п (55) 55 39 49 3 13 16 26 29 
(7) 27 31 54 42 1 12 16 46 (56) 56 45 5 14 16 25 36 34 
(28) 28 32 36 47 2 13 17 М (57) 57 43 42 10 20 34 18 2 


(29) 


PLANS 


534 INCOMPLETE LATIN SQUARES 
Plan 13.4a t=73,k =9,r =9,b = 73, = 1, E = .90, Type I 
Reps. Reps. 
Block I II Ш IV V УІ УПУШИХ Block I II Ш IV V VI VII ҮШІХ 
() 1 .2 :3 "4 5 6. 7. 865 (98) 38 43 52 б 7 10 17 32 70 
(0) 2 66 18 26 34 42 50 10 58 (39) 39 70 49 64 6 11 20 42 29 
з) 3 11 00 27 35 19 51 43 59 (40 40 4l 5 03 70 12 19 50 30 
а) 4 12 20 66 36 44 52 28 60 (41) 41 42 43 65 44 46 47 45 48 
(5) 5 13 21 29 37 45 53 61 66 (42) 42 54 63 ТІ: 17 28 18 35 
(D 6 14 22 30 38 66 54 62 46 (43) 43 55 62 5 10 20 25 34 т 
(7) 7 15 28 31 39 47 66 63 55 (44) 44 56 61 6 15 20 71 33 19 
(8) 8 16 24 32 40 48 56 66 64 (45) 45 49 60 3 10 71 22 40 зї 
0) 9 10 11 12 13 65 16 14 15 (46) 46 71 50 21 9 4 32 59 39 
(10) 10 19 28 1 46 55 67 64 37 (47) 47 51 58 71 12 2% 29 1 38 
(1) 11 67 25 40 47 54 18 4 61 (48) 48 52 71 37 11 23 30 2 57 
(12) 12 17 67 39 48 53 26 3 62 (49) 49 50 65 52 53 56 55 54 51 
(13) 13 24 31 34 41 52 59 6 67 (50) 50 61 1 48 20 14 35 31 72 
(14) 14 23 32 33 42 51 60 67 5 (51) 51 64 4 15 17 72 34 30 45 
(15) 15 22 29 67 43 50 57 36 8 (52) 52 63 72 16 18 29 2 46 33 
(16) 16 21 30 35 67 49 58 44 7 (53) 6 9 23 28 40 43 
(17) 17 18 19 20 21 22 65 23 24 (54) 54 57 39 10 24 27 72 5 44 
(18) 18 30 68 43 56 60 1 39 13 (55) 55 60 8 11 72 21 38 26 41 
(19) 19 31 38 42 68 57 4 53 16 (56) 56 59 7 72 22 95 37 12 42 
(20) 20 32 37 41 54 58 68 15 3 (57) 57 65 59 60 6l 62 65 58 03 
(21) 21 25 36 08 51 63 6 48 10 (58) 58 8 14 19 25 39 45 73 52 
(22) 22 68 26 47 52 64 535 9  (59)-59 Б 15 18 28 73 48 38 49 
(23) 23 27 34 46 49 61 8 68 12 (60) 60 6 16 17 27 37 73 47 50 
(24) 24 28 33 45 50 08 62 7 1 (61) 61 з 9 24 30 30 42 55 73 
(25) 25 26 27 28 29 30 31 65 32 (62) 62 4 10 23 73 35 41 29 56 
(0) 26 37 09 49 63 43 14 ?4 4 (03) 63 1 73 22 32 34 44 53 
(27) 27 40 42 69 62 1 15 52 21 (4) 64 73 12 2 31 33 43 21 54 
(28) 28 39 4l 51 09 2 01 16 22 (65) 65 72 70 73 6 67 69 71 68 
(29) 29 69 48 54 60 7 9 19 34 (66) 66 9 17 25 33 4l 49 57 1 
(30) 30 33 47 53 359 8 10 20 69 (бт) 67 20 56 38 45 9 63 27 2 
61) 31 36 46 56 38 5 11 69 17 (ву 68 29 40 44 55 59 2 17 14 
(32) 32 35 45 55 57 09 12 18 6 (6) 69 38 44 50 04 3 13 25 23 
(93) 33 34 35 30 65 97 39 37 40 (70) 70 48 55 58 4 12 33 22 27 
(M4) 34 47 57 14 3 70 21 56 28 (71) 71 53 64 7 14 18 27 4l 36 
(85) 35 46 53 70 2 15 24 60 25 (тәу 72 62 2 13 19 32 36 49 47 
(86) 36 45 54 59 1 16 23 70 26 (73) 73 7 13 57 26 40 46 51 20 
8 


(37) 


>, 
3 
© 
© 


PLANS 535 

Plan 185а t= 91, k = 10,7 = 10, = 91, à = 1, Е = .91, Туре I 
Reps, Reps. 

Block I П Ш IV V VI ҮПҮШІХ X Block I II II IV V VI УПҮШЇХ X 
( 1 2 7 H 24 27 35 42 54 56 (47) 47 48 53 57 70 73 81 88 9 1 
2) 2 3 8 12 25 28 36 43 55 57 (48) 48 49 54 58 71 74 82 89 10 12 
(3) 3 4 9 13 26 29 37 44 56 58 (49) 49 50 55 59 72 75 83 90 11 13 
() 4 5 10 14 27 30 38 45 57 59 (50) 50 51 56 60 73 76 84 91 12 14 
() 5 6 11 15 28 31 39 46 58 60 (61) 51 52 57 61 74 77 85 1 13 15 
() 6 7 12 16 29 32 40 47 59 61 (52) 52 53 58 62 75 78 86 2 14 16 
() 7 8 13 17 30 33 41 48 60 62 (53) 53 54 59 63 76 79 87 3 15 17 
() 8 9 14 18 31 34 42 49 61 63 (54) 54 55 00 64 77 80 88 4 16 18 
(9) 9 10 15 19 32 35 43 50 62 64 (55) 55 56 61 65 78 81 89 5 17 19 
(10 10 11 16 20 33 30 44 51 63 65 (56) 56 57 62 66 79 82 90 6 18 20 
(0) m 12 17 20 34 37 48 52 64 66 (57) 57 58 63 67 80 83 91 7 19 2 
(12) 12 13 18 22 35 38 46 53 65 67 (58) 58 59 64 68 81 84 1 8 20 22 
(13) 13 14 19 23 36 39 47 54 66 68 (50) 59 60 65 69 82 85 2 9 2l 23 
(14) 14 15 20 24 37 40 48 55 67 69 (60) 60 61 66 70 83 86 3 10 22 24 
(1) 15 18 21 25 38 41 49 56 68 70 (61) Ol 62 67 71 84 87 4 11 23 25 
(16) 16 17 22 26 39 42 50 57 69 71 (62) 62 63 68 72 85 88 5 12 24 26 
(17) 17 18 23 27 40 43 51 58 70 72 (63) 63 64 69 73 86 89 6 13 25 27 
(18) 18 19 24 28 41 44 52 59 71 78 (64) 64 65 70 74 87 90 7 14 20 28 
(19) 19 20 25 29 42 45 53 00 72 74 (65) 65 66 71 75 88 91 8 15 27 29 
(20) 20 21 26 30 43 46 54 61 73 75 (66) 66 67 72 76 89 1 9 16 28 30 
о) 21 22 27 31 44 47 55 62 74 76 (67) 67 68 73 77 90 2 10 17 29 31 
(02) 22 23 28 32 45 48 56 63 75 77 (68) 68 69 74 78 91 3 1 18 30 32 
(23) 23 24 29 33 46 49 57 64 76 78 (69) 69 70 75 70 1 4 12 19 31 33 
(04) 24 25 30 34 47 50 58 65 77 79 (70 70 71 76 80 2 5 18 20 32 81 
(05) 25 26 31 35 48 51 59 66 78 80 ат 72 7 8 3 6 1 2l 33 35 
(26) 26 27 32 36 49 52 60 67 79 81 (72) 72 73 78 82 4 7 15 22 34 36 
(27) 27 28 33 37 50 53 бі 68 80 82 (73) 73 74 79 88 5 8 16 23 35 37 
(28) 28 29 34 38 51 54 62 69 81 83 (74) 74 75 80 84 6 9 17 24 36 38 
(29) 29 30 35 39 52 55 63 70 82 84 (75) 75 76 81 85 7 10 18 25 37 39 
(30) 30 31 36 40 53 56 04 71 83 85 (7g) 76 7 82 86 8 11 19 26 38 40 
(81) 31 32 37 41 54 57 65 72 84 86 (7) 77 78 83 87 9 12 20 27 39 4l 
(32) 32 33 38 42 55 58 66 73 85 87 (78) 78 79 84 88 10 13 21 28 40 42 
(3) з 34 9 43 56 59 07 74 86 88 (7) 79 90 85 89 11 14 22 29 4l 43 
(34) 34 35 40 44 57 60 68 75 87 89 (80 80 81 86 90 12 15 23 30 42 44 
(35) 35 36 41 45 58 61 69 76 88 90 уз 82 87 91 13 16 24 31 43 45 
(36) 36 37 42 46 59 62 70 77 89 91 (82) 82 83 88 1 M 17 25 32 44 46 
(37) 37 38 43 47 60 63 71 78 90 1 (83) 83 84 89 2 15 18 26 33 45 47 
(083) 38 39 41 48 61 04 72 2 (yy gy 95 00 3 16 19 27 34 40 48 
(39) 39 40 45 49 62 65 73 80 i. (85) 85 86 91 4 17 20 28 35 47 49 
(40) 40 41 46 50 63 66 74 81 24 (86) 86 87 1 5 18 21 29 36 48 50 
UD a 42 47 51 64 67 75 8 8 5 ут s 2 6 19 22 90 37 49 5i 
(B) 33 43 48 52 65 08 76 83 4 6 (ы 9 3 7 20 23 31 38 50 5 
а) 3) 4i 49 53 68 09 77 84 5 7 шош 4 8 21 24 32 39 5 53 
ам 45 80 54 67 70 78 85 6 8 шы 5 9 22 95 33 40 52 5 
(45) 45 46 51 55 68 71 79 86 7 9 (91) ГИЛЕ. ЕТИШЕ. ЖОШ. x 
(46) 46 47 52 56 69 72 80 87 8 10 


536 INCOMPLETE LATIN SQUARES 


Plan 18.16 1= 3,6 = 5,т =5,b =3,d = 5, Е = .96, Type ПІ 
Reps. 
Bock I ПІ ІП IV V 
Пу 1 2 3 2 3 
(2) 2 3 1 1 2 
(3) 3 1 2 3 1 
Plan 13.17 1=3,k =7,r = Т, = 3, ^ = 7, E = .98, Type ТУ 
Reps. 
BlokI II Ш IV V VI ҮП 
ar* 2 3 1 2 3 1 
() 2.3 1 3 1 2 2 
(3) 3 1 2 2 3 1 3 
Plan 13.18 1= 3,6 = 8,7 = 8,0 = 3, à = 8, Е = .98, Type Ш 
Reps 
BlokI II Ш IV V УІ ҮП ҮШ 
() 1 2:—15 1 2. 3 1 2 
22 3 1 3 1 2 2 3 
(8) 3 1 24:2 38 1 3 1 
Plan 1810 t =3,k = 10, r = 10, b = 3, à = 10, Е = .99, Type IV 
Reps. 
BlekI II IH IV V VI VII УШ IX X 
@) 1 2. 8 1 2 3 1 228 1 
22 3 1 3 1 28 12) 28 d 3 
(3) 3 1 A 3 H 3 1 2 2 
Plan 18.20 t=4,k =5,r=5,b =4,\ = 5, Е = .96, Туре ТУ 
Reps. 
BlekI П Ш IV V 
HEIL CT 3 4. t 
223 8 4 1 4 
(3) 3 4 1 2 3 
(4) 4 1 i 


PLANS 


Plan 13.21 t=4,k = т =7,b = 4,\ = 7, Е = .98, Type ИТ 
Reps. 
BlekI П Ш IV V VI VII 
шт 2 Б] 4 1 2 Б] 
(2) 2 1 4 Б] 3 4 1 
(3) 3 4 1 24 9 2 
6064 D2 SS TM 
Plan 13.22 1= 4,6 =9,r=9,b =4,\ = 9, Е = .99, Туре IV 
Reps. 
Block I < ID Hi IV Ж УЕ ЗУУШ: IX 
(1 1 2 3 4 1 2 3 4 1 
(2-2 ігуі (У42Ұ2 ves Ін 
ЕЕ Е s $210 2 
(4) 4 3 2 H 2 1 4 3 3 
Plan 18.28 t=5,k =6,r=6,b =5,\ =6, Е = 97, Type IV 
Reps. 
BlckI П Ш IV V VI 
in P x t EE 
(2) 2 4 5 3 1 3 
(3) 3 1 2 5 4 4 
(4) 4 5 1 2 3 2 
(5) 5 3 4 1 2 5 
Plan 13.24 1 =5,k =9,r =9,b =5, = 9, Е = 99, Type Ш 
Reps. 
BlokI I Ш IV V VI ҮП ҮШ IX 
() 12-9139 Бе 
@ 2 tare 2 iL dU M ya 
(y = ЕЕЕ a 
а 3 
(қар En m Eur eH ОИ 


537 


538 INCOMPLETE LATIN SQUARES 
Plan 18.25 1t=6,k =7,r =7,b =6,à = 7, Е = 98, Type IV 


Reps. 
Б ОКТ IE M Wi. V. VIVI 
() 1 2 3 4 5 6 1 
Q 3 a € 5.4 b (8 
(3) 3 6 2 5 1 4 5 
d 4 xk 5 2 & зм 
(5 5 1 4 6 3 2 6 
(0016 — 14,0 -5 1.84 /2--2- 2 


Plan 18.26 t=7,k = 8,т = 8,0 =7,r = 8, Е = .98, Туре IV 
Reps. 


BlokI П Ш IV V УІ VII ҮШ 
(011 5258 ®4 ЕБ 16 7 1 
Gta 5 1.7 16%1 vg 
Os 6 7 ЖӨЖ 1 53 7 
(T4. »Т® gs 56- 807 тд Aqu 
о Xam Ж X B5 
(0) 8; 8) Я. Шу Xr Ux. A 
OSs ERU. 2 


EXTENDED INCOMPLETE LATIN SQUARES 
Plan 18.64 t=5,k = 2, r = 4, b = 10, à = 1, E = 67, Type II 


Reps. Reps. 
BlokI IL ш IV 
(b) а () 1 3 
(2) 2 5 (7) “2-4 
33 4 (39. s" 2 
(Core 31 0) 4 5 
() 5 3 00 5 1 


Plani$7a  t—5,k-3,r-6, 6-104 = 3, Е = 83, Type II 


Reps. Reps. 
Block I II III Ж wv м 
(D 1 “92-8 (rs 2 4 
(5) МЛ () 2 3 5 
0 з 4 2 (8) 3 * 1 
(4) 4 5 1 (9) 4 5 2 
(5) 5 Б] 4 (10) 5 1 3 


PLANS 539 
Plan 188а #=7, Е =2,r=6,b = 21, X = 1, Е = 58, Type У 


Reps. Reps. Reps. 
Block I П HI IV у VI 
a@ І 2 (@ 1 3 5 1 4 
@ 26 @ 2-4 (16) 2 3 
(33 4 (0) 3 5 (1) 3 6 
(4 7 ay 4 6 (8) 4 5 
5 1 (19) 5 7 (19) 5 2 
(0 6 5 (13) 6 T (206 7 
() Ф 13 а4 7 2 (27 1 


| 


Plan 13.9a 1t=9,k = 2,r = 8, b = 36, à = 1, Е = .56, Type V 


Reps. Reps. Reps. Reps. 
Block I П ІШ IV V м ҮП УШ 
dj iag (10) 1 3 9 1 4 (8) 1 5 
22 8 (1) 2 5 (20) 2 6 (29) 2 4 
(3) 8 4 (12) 8 6 (из 2 (30 3 8 
(4) 4 7 (13) 4 9 (22) 4 5 (3) 4 6 
(5) 5 6 (4) 5. 8 (UMS 7 (82) 5 3 
(6 1 (15) 6 7 (24) 6 8 (33) в 9 
б T 3 (16) 71 T (25) 7 9 (80-10 
8 9 (17) 8 4 (26) 8 1 (35 8 Ш 
99 5 (8) 9 2 (27) 9 3 (36) 9 1 
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CHARTER ‚ВА 


Analysis of the Results 


of a Series of Experiments 


141 Initial Steps in the Analysis 


14.11 Introduction. In a program of research it is quite common to 
repeat the same experiment at a number of different places, on a number 
of different occasions. There may be several reasons for this. Some- 
times the object of the research is to produce recommendations which 
are to apply to a population that is extensive either in space or in time 
or in both. Thus in agricultural field experimentation, many projects 
are undertaken in the hope that their results can be applied in practical 
farming. The conclusions drawn from such research, if they are to be of 
use, must be valid for at least several seasons in the future and over a 
reasonably large area of farm land. It has been found that the effective- 
ness of the common plant nutrients, of different varieties of a crop, and 
of different cultivation practices usually varies from field to field and, 
even more markedly, from season to season. А single experiment, how- 
ever well conducted, supplies information about only one place and one 
season. Consequently such experiments are carried out at several dif- 
ferent places in the area for which recommendations are wanted, and are 
repeated for a number of seasons. 

In other cases we may be interested, not in making inferences about 
some specific population, but in studying the influence of external con- 
ditions on some measurement or on the responses to treatments. For 
example, is the vitamin A content of a vegetable affected by the climate 
in which it is grown? Do the relative durabilities of two types of road 
surface depend on weather or topography or traffic load? Repetition of 
such experiments in different places is necessary in order to have vari- 
ations in the external factors that are under investigation. 

A third example is supplied by collaborative experiments on biological 
assay by a group of laboratories. Here the objects may be to obtain in- 
formation about the accuracy with which the potency of a drug can be 
estimated and to discover whether different laboratories reach the same 
conclusions about the relative potencies of different drugs. Bliss (14.1) 
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gives an interesting account of a series of experiments of this type, where 
sixteen laboratories estimated the potency of preparations of digitalis by 
injection into cats. 

The appropriate statistical analysis for the data from a series of ex- 
periments will of course vary with the object of the research. Never- 
theless, the preliminary stages of the analysis tend to be the same in all 
cases. In this chapter an introductory account is given of these initial 
procedures, which present difficulties that are not always appreciated. 
The first step, which is to analyze and interpret the data from each in- 
dividual experiment, will he assumed to have been completed. 

When we begin to combine the data, the first point of interest is to 
examine whether the differences among treatments are the same in all 
experiments. This question is likely to be relevant whatever the purpose 
of the experiments. In experiments designed to lead to the recommenda- 
tion of a “best” treatment for some operation, we wish to know whether 
there is a consistent superiority of certain treatments, or whether on the 
other hand we may have to consider recommending different treatments 
for different circumstances. In the other two types of experimentation 
mentioned above, the test would indicate whether the responses to treat- 
ments have varied with the external conditions of the experiment, or 
whether the different laboratories agreed in their estimates of relative 
potency. 

Secondly, it is often, though not always, desirable to estimate and 
compare the average effects of treatments over the whole series of ex- 
periments, 


1412 Numerical Example. The data come from a group of 6 experi- 
ments on Irish potatoes conducted in two counties of North Carolina in 
1945 and 1946. The results have been described by Nelson and Hawkins 
(14.2), and form part of a study of the responses to applications of super- 
phosphate on soils of varying degree of fertility. In 1945, on each of the 
6 sites, the amount of readily soluble phosphorus in the soil itself was 
estimated from soil samples by the modified Truog method. These 
amounts varied from 48 to 850 Ib. P.O; per acre. The treatments com- 
prised 5 different levels of application, 0, 40, 80, 120, and 160 Ib. P2Os 
per acre. 

These experiments are an example of the second type discussed above, 
in that their object was to find out the extent to which the responses to 
treatments were influenced by the condition of the soil. The soils were 
not intended to be a representative sample of the soils in the counties. 
but were chosen so as to give a wide range in fertility. 
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Potatoes were grown on the plots in 1945 and 1946, the treatments 
being applied in both years, with in addition substantial applications of 
nitrogen and potash on all plots so that these nutrients would not be 
deficient. The data to be analyzed are the 1946 results. The mean yields 
and error m.s. per plot are shown in table 14.1. Іп five of the six experi- 


TABLE 14.1 TREATMENT YIELDS (100 тв. PER ACRE) or IRISH POTATOES, 1946 


Experiment number 
Pounds Рҙ05 мураб аа |0: Вла о4 5 6 
applied Amount of readily soluble phosphorus 
per acre (pounds P40;) іп soil, 1945 

48 310 410 710 790 850 
0 114 142 180 170 130 170 
40 206, 176 188 171 140 178 
80 231 201 207 188 150 187 
120 237 205 208 185 152 188 
160 252 217 217 189 156 189 
Г 106 83 158 43 204 52 
d.f. 21 12 12 12 12 12 
rt 4 5 5 5 E m. 5 


* Error m.s. per plot. 
1 Number of replications, 


ments the Ғ-табов for treatments against error were significant. One 
feature of the results in table 14.1 is that experiment 1, which has the 
lowest amount of available phosphorus in the soil according to the soil 
tests, actually gave the highest total yield in 1946. It should be re- 
membered that the soil tests were made in 1945, and that some residual 
effects of the 1945 applications may have persisted. Whatever the rea- 
son, this soil was apparently very responsive to phosphorus in 1946; the 
plots without phosphorus do have the lowest yield of all places. 

It is possible to compute from table 14.1 a combined analysis of vari- 
ance for all six experiments. As will be seen later, there are limitations 
to the use of such an analysis. For the present we will ignore any сот- 
plexities that may arise. Further, in order to discuss only the simplest 
case at first, we will omit experiment 1, which differs in size and structure 
from the other experiments. This omission is of course inappropriate 
from the agronomic point of view, but the data are being used to throw 
light on the general procedure in analysis rather than on the agronomic 
questions involved. 
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14.13 Preliminary Combined Analysis When all Experiments Have the 
Same Design. Formally, the analysis subdivides into the following 
components. 

Places 

Treatments 

Treatments X places 

Pooled experimental error 


In addition, in a complete analysis there are terms representing the dif- 
ferences among rows and columns in the individual experiments. Since 
these are not relevant to the interpretation, they are not included. 

The analysis can be computed from either treatment means or totals; 
the totals will be used here and are shown in table 14.2. 


TABLE 14.2 TREATMENT TOTALS FOR EXPERIMENTS 2 то 6 (iN 100 тв.) 


Pounds Experiment 
P305 2 3 4 5 6 
0 710 900 850 650 850 | 3,960 
40 880 940 | 855 700 890 | 4,265 
80 1,005 | 1,035 940 750 935 | 4,665 
120 1,025 | 1,040 925 760 940 4,690 
160 1,085 | 1,085 945 780 945 | 4,840 


Total 


Total | 4,705 | 5,000 | 4,515 | 3,640 | 4,560 | 22,420 


The first 3 components are computed in the same way as for a single 
randomized blocks experiment, except that all sums of squares are di- 
vided by an extra 5 in order to reduce them to a single-plot basis. Thus 
we have 
(4705)? +- - - + (4560)? if (22,420)? 


P! : = 
laces: 55 = 41,367 
3960)? +... 2 2 
T зн (3960) x + (4840) ee 20,979 


To obtain the treatments X places ss. we calculate the total s.s. for 
table 14.2, which comes to 69,199. Then 


"Treatments X places: 69,199 — 41,367 — 20,979 = 6853 


Since all experiments have the same number of error degrees of free- 
dom, the pooled error m.s. may be found as the simple average of the 
five s values in table 14.1. This comes to 108, as given in table 14.3. 

The null hypothesis that the treatment differences are the same at all 
Places (ie., that there are no treatment Х place interactions) is tested 
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by the F-ratio 428/108, or 3.96, with 16 and 60 d.f., respectively. Since 
the 5% level is 1.81, the ratio is definitely significant. 


TABLE 14.3 COMBINED ANALYSIS OF VARIANCE (ON A SINGLE-PLOT BASIS) 


Source of variation d.f. 8.8. m.s. 
Places 4 41,367 
"Treatments 4 20,979 5,245 
Treatments X places 16 6,853 428 
Pooled error 60 6,480 108 


А test of the average responses to treatments taken over all five places 
is of minor interest in this example, because the places do not constitute 
а random sample from a population about which we wish to make in- 
ferences. For purposes of illustration, however, we will assume that the 
places were selected as a random sample of fields on which potatoes might 
be grown commercially. 

In the F-test of the average effects of treatments there are two possible 
candidates for the denominator of F—the mean square for the treat- 
ments X places interactions (428) or the pooled error m.s. (108). Some- 
times this competition does not arise, because our knowledge of the data 
and the F-test of the interactions both indicate that there is no reason to 
suppose real interactions to be present. In that event the mean squares 
for interactions and error may be pooled to form a single denominator 
for the F-test of treatments. But frequently the experimental conditions 
are such that we expect interactions to be present. This is so in the ex- 
ample, where previous work in a number of countries has shown a rela- 
tion between the response to phosphorus and the amount found in the 
soil by a soil test. Consequently, even if the interactions m.s. had not 
proved significant, we might have been unwilling to assume that there 
were no interactions. 

In discussing the two F-ratios it is helpful to examine the mathematical 
model on which the combined analysis is based. If ту is the observed 
mean for the jth treatment at the ith place, we postulate that 


шу = p + Ti + rj Ри; T; (14.1) 


where ту, 7; represent the effects of the place and the treatment, respec- 
tively, шу that of the treatment X place interaction, and ё;; that of the 
experimental error. ё;; is the average of the errors on the r plots that re- 
ceive the treatment at that place. 

With this model we can study the nature of the mean squares in the 
analysis of variance. If the experimental errors on individual plots have 
а variance o,”, and if the interaction terms ду; may be considered to have 
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a variance c,?, the average values of the mean squares work out аз fol- 


lows. 
TABLE 14.4 EXPECTED VALUES OF MEAN SQUARES 


Source of variation Expected value of mean square 
Treatments oe + roy — 7)? 
Treatments Х places oe + тоц? 

Pooled experimental error oe 


The symbols r, p, and ¢ stand respectively for the numbers of replica- 
tions, places, and treatments. 

It will be noted that the treatments m.s. is influenced by three com- 
ponents—the experimental error variance, the variance of treatment X 
place interactions, and the variance among the true treatment means 
7j. The pooled experimental error takes account of only the first of these 
components. Hence the treatments m.s. may be statistically significant, 
as compared with the pooled error m.s., either because there are real dif- 
ferences among the 7’s or because treatment X place interactions are 
present. This F-test is informative only when we are indifferent as to 
whether a significant treatments m.s. was due to real treatment dif- 
ferences or to interactions. This situation is rare in practice. In most 
cases, on the contrary, having established that interactions are present, 
or at least that it is not safe to assume them absent, we wish to know 
whether in addition there are consistent differences among the effects of 
treatments. As table 14.4 shows, the appropriate denominator of Ё for 
this test is the interactions m.s. This contains both the interaction and 
error components of variation in exactly the same way as they enter into 
the treatments m.s. The F-ratio, 5245/428, or 12.26, with 4 and 16 d.f., 
is significant at the 1% level. 

The conclusions from this initial analysis are: (i) there are real dif- 
ferences in response that are consistent from place to place, and (ii) there 
are real variations in responsiveness from place to place. It need not be 
stressed that these statements do not constitute a competent summary of 
the results. They merely indicate what should be examined next. In 
the present case we would consider which treatments have proved con- 
sistently superior, and, if possible, why this is so. Similarly, we should 
investigate the nature of the interactions and try to find a rational ex- 
planation for them. 


14.2 Criticisms of the Preliminary Analysis 


14.21 Heterogeneity of the Interaction Variance. The previous anal- 
ysis is open to several criticisms. Although these criticisms are not valid. 
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for all series of experiments, our experience is that it is well to be on one’s 
guard against them. They deal essentially with the assumptions on 
which the combined analysis was based. 

These assumptions have been presented in section 14.13. Specifically, 
we postulate that the model (14.1) holds. Further, the experimental 
errors е; of individual observations are assumed to be normally and in- 
dependently distributed with the same variance c,?. Finally, for the 
F-test of the treatments m.s. against the treatments X places inter- 
actions, we require also the assumptions that the interaction terms pij 
are normally and independently distributed, with zero population means 
and the same variance ¢,”, and are independent of the e's. 

The first eriticism is that some components of the treatments X places 
в.в. may be much larger than others, or in mathematical terms that the 
“interaction” variance оу? is not constant. This will happen if the ef- 
fectiveness of some treatments varies greatly from place to place, while 
that of others varies less or not at all. 

If the interaction m.s. is heterogeneous in this sense, the F-test of 
treatments against interactions is vitiated. The general effect is that the 
F value read from the tables is too low, i.e., that too many significant re- 
sults are obtained. Some idea of the extent of the bias can be obtained in 
certain extreme cases. In the numerical example, if one comparison 
among the treatment means has a much larger interaction variance than 
any other comparisons, the F-ratio is distributed approximately as an 
F value with 1 and 4 d.f., respectively, instead of 4 and 16 d.f. The 5% 
significance level would be 7.71 instead of 3.01. More generally, if there 
are p places, F is distributed approximately with 1 and (p — 1) d.f. 
This situation produces about the greatest distortion in P that is likely 
to arise, so that 7.71 could be regarded as an upper limit to the signifi- 
cance level of F. | 

The exact distribution of F сап be worked out, but is not available in 
a form adapted for practical use. Even so, in many instances, uncer- 
tainty about the correct significance level for F does not preclude us f rom 
drawing conclusions from the test. For, in the example above, if F 
turned out to be 1.38 we would be confident that it is not significant, since 
the 5% level is at least 3.01. Similarly, there seems little doubt that the 
F value actually obtained in the experiments, 12.26, 2s significant, since 
it lies well above 7.71. On the other hand, we would be uncertain about 
significance if Г happened to lie between 3 and 7. M 

А better method of coping with this difficulty is to divide the treat- 
ment s.s. into a set of orthogonal components that will supply all or most 
of the information of interest to us. The interactions s.s. 18 partitioned 
in the same way so as to isolate the interactions of each component with 
places. By Bartlett’s test for homogeneity of variance, reference (14.3), 
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we can then test whether c,? is the same for all the components of the 
interactions s.s. If we decide that ¢,’ cam be assumed constant, the 
difficulty vanishes. If c,? is not constant, it is valid to test any com- 
ponent of the treatments s.s. against its own interaction with places. 
Apart from the extra computations, the only drawback to this pro- 
cedure is that the degrees of freedom in the denominator of F are reduced. 

These remarks will be illustrated by the example. In these experi- 
ments the principal point of interest is to see whether the average re- 
sponse to a given amount of phosphorus decreases when the amount 
presumed to be in the soil increases. It may also be worth while to ex- 
amine whether the rate of decline in response with the higher levels of 
dressing changes with the nature of the soil. These questions can be 
considered conveniently by fitting at each place a parabolic regression 
of the yield on the amount of dressing. Although a parabolic regression 
would scarcely be regarded as the true form of the response curve, it ap- 
pears to fit the data well. 

Since the dressings increase by equal amounts, the orthogonal poly- 
nomials given by Fisher and Yates (14.4) are suitable. The calculations 
are shown below for experiment 3. The linear term is proportional to 


Multipliers for 


эк? тшш 

Amount Total Linear Quadratic 
of P305 yield term term 
0 900 -2 2 
40 940 -1 --1 
80 1035 0 -2 
120 1040 1 -l 
160 1085 2 2 
Sum of products 470 —80 
Divisor for square 50 70 


the average increase in yield per 40 Ib. of P3505: the quadratic term may 
be interpreted as measuring the rate of decline in response with increased 
dressings. The values found for the two terms are as follows. 


Experiment 
2 3 4 5 6 Total 


L 895 470 260 320 240 2185 
9 | —325 —80 -70 - 100 -110 —685 
The contributions to the treatments s.s. are 


(2185)? 685)? 
= 19,097; tado: f ) 


Linear: 
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In the treatments X places s.s. we have 


895)? +- - -+ (240)? 
кср машы D? 10097 = 5893 


325)? +- - -+ (110)? 
дхлікені Қасы 9) 


In table 14.5 the three interaction mean squares are in the same order of 
size as the corresponding treatment mean squares. This result is 
typical, in that large effects tend to have large interactions. It is obvious 
on inspection that the components of the interaction m.s. cannot be 
regarded as homogeneous. Accordingly, we test the linear component 
of treatments against the mean square 1478 instead of against the com- 
plete treatments X places m.s. of 428 in table 14.3. This makes a sub- 
stantial difference to the F-ratio, though it remains significant. The 
quadratic X places m.s. (161) is not significantly above the error m.s. 
(108), but it seems prudent to use it as the denominator in the F-test of 
the quadratic term in treatments. The remainder of the interactions 
shows no sign of significance, and could be pooled with error for an F- 
test of thé deviations from regression. 


— 1341 = 645 


TABLE 14.5 SUBDIVISION OF THE ANALYSIS OF VARIANCE 


Source of variation d.f. 5.8, m.s, 
Treatments 
Linear 1 19,097 19,097 
Quadratic 1 1,341 1,841 
Deviations from regression 2 541 270 
Treatments X places 
Linear X places 4 5,893 1,473 
Quadratic X places 4 645 161 
Deviations X places 8 315 39 
Pooled error 60 6,480 108 


14.22 Heterogeneity of the Experimental Error Variances. А second 
criticism concerns the assumption that the experimental error variances 
с are the same in all experiments. In general this assumption will hold 
only if all experiments have been conducted in the same way, with the 
same amount of control over environmental conditions and with experi- 
mental material of the same variability. In experiments with crops or 
animals this degree of uniformity is seldom attainable, because the 
natural variability among pieces of land or among animals at one place 
differs from that at other places. Hence in this and many other types of 
cooperative experimentation we expect а priori that experimental error 
variances change from place to place. 
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When there is doubt about this point, Bartlett’s test of homogeneity 
of variances can be applied to the error m.s., s? in the experiments, In 
the example (table 14.1) the s? values varied from 43 in experiment 4 to 
204 in experiment 5, and the test of homogeneity shows that c,? cannot 
be assumed constant. 

Variation in о.? invalidates the F-test of the interactions m.s. against, 
the pooled error m.s. Although the effect on the significance level of F 
is not, known exactly, it operates so that use of the tabular F produces 
too many significant results, As in the previous section, the extent of 
the distortion in Ё can be seen in extreme cases. If one experiment has 
a much higher error variance than any of the others, F will be distributed 
approximately as the tabular F with (£ — 1) and n’ degrees of freedom, 
where ¢ is the number of treatments and 7’ is the number of error degrees 
of freedom in the experiment, with the high error variance. In the ex- 
ample, if one of the experiments with 12 error d.f. happened to be much 
less accurate than the other experiments, would have 4 and 12 d.f. in- 
stead of 16 and 60 as used іп the test in section 14.13. Тһе 5% level 
would be 3.26 instead of 1.81. Since this case appears to be the most un- 
favorable that would occur, we may conclude that the true significance 
level of F lies somewhere between these values, 

If the observed F in our data falls outside these limits, a definite соп- 
clusion can be reached from the test without further knowledge of the 
exact significance level of P. This happens in the example, where the 
observed F-ratio for the interactions m.s. is 3.96. However, even if the 
exact significance level were known, the test can be criticized because 
the F-ratio is no longer the most sensitive test criterion. Studies have 
shown that, with the amount of variation in сг? that appears typical of 
agricultural experimentation, this loss of sensitivity might be equivalent 
to discarding 10 to 20% of the data. An approximate test that avoids 
some of the loss in sensitivity will be presented in section 14.4, 

Thus far we have been discussing how the F-test of the interactions is 
affected by heterogeneity in the error variances. The F-test of treat- 
ments may now be considered. If the interactions are regarded as neg- 
ligible, so that the denominator of F is the pooled m.s. for interactions 
and error, the F-test of treatments is invalidated in about the same way 
as that of interactions. On the other hand, when interactions are 
present, and especially when they are large, the F-test of the treatments 
m.s. against the interactions m.s, is much less disturbed. The reason 
may be clearer if we consider the test of the linear component of treat- 
ments in table 14.5. The suggested denominator for this test was the 
linear X places component, 1473. This is nearly 14 times the pooled 
error m.s. It appears, therefore, that heterogeneity in the error variances 
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can influence only a small part of the interaction m.s., so that its effect 
is, as it were, greatly reduced. These remarks apply with much less foree 
to the quadratic X places m.s., to which, taking the data at their face 
value, the pooled error contributes more than half. 


14.23 Summary. When all experiments are identical in structure, a 
combined analysis of variance can be computed relatively easily, Ар- 
plication of the ordinary tests of significance to this analysis is frequently 
open to question because of heterogeneity in the error and interaction 
variances. Despite this, it is advisable to draw preliminary conclusions 
as far as possible from this analysis, at least in the present state of our 
knowledge, because tests that are fully efficient and theoretically sound 
have not yet been discovered, and the approximate tests that have been 
devised to meet the criticisms are more laborious. 

When the interactions are sizable, there is usually little difficulty in 
interpreting the combined analysis. For, if the F-ratio for interactions 
is large, we may be confident that it is statistically significant in spite of 
some uncertainty about the exact significance level of F, and the crit- 
icism that the F-test is not fully sensitive carries less weight if F estab- 
lishes significance. Further, as we have seen, the F-test of the treat- 
ments m.s, against the interactions m.s. is little affected by inequality 
in the error variances when the interactions are large. The chief point 
to remember is that the interactions m.s. may itself be heterogeneous, 
Subdivision of the treatments and interactions m.s. according to the 
treatment comparisons that are of greatest importance is often 
useful, 

Another situation that presents little difficulty occurs when the aver- 
age differences among treatments are substantial, yet interactions are 
negligible, In this case it may be found that neither the Ё value for in- 
teractions nor that for treatments is close to the significance level. The 
cases that leave us in doubt are those where the / values are just slightly 
above the tabular significance levels. 


14.3 Experiments of Unequal Size 


14.31 Numerical Example. Provided that the same set of treatments 
appears in all experiments, a combined analysis can usually be con- 
structed even when the experiments differ in size and structure. Some 
issues arise, however, that are not encountered when all experiments are 
identical in design. The problems will be illustrated by the inclusion of 
the first experiment in the previous example, ‘This was arranged in гап- 
domized blocks with 4 replications, whereas all other experiments were in 
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5 X 5 latin squares*. The treatment totals and numbers of replications 
are shown in table 14.6. 


TABLE 14.6 TREATMENT TOTALS (in 100 Ib.) 


Pounds Experiment Tota | №: of 
P205 1 2 3 4 5 6 plots 

0 456 710 900 850 650 850 | 4,416 29 

40 824 880 940 855 700 890 | 5,089 29 

80 924 | 1,005 | 1,035 940 750 935 | 5,589 29 

120 948 | 1,025 | 1,040 | 925| 760 | 940 | 5,638 | 29 

160 1,008 | 1,085 | 1,085 945 780 945 5,848 29 

Total 4,160 | 4,705 | 5,000 | 4,515 | 3,640 | 4,560 | 26,580 145 
No. of plots 20 25 25 25 25 25 


A combined analysis can be obtained by following the standard pro- 
cedure for data based on unequal numbers. The square of any quantity 
is divided by the number of replications involved. The sums of squares 
are given below. 


56)? Ot 2 wm 2 2 
oit (456)? +- - - + (1008) ip (19) +++ (945) _ (26,580) = 131,925 


т 
4 5 145 
2 Бу? dou 2 2 
Places; (160) т (4705)? +... + (4560)? (26,580) = 55,509 
20 25 145 
емад 2 2 
ааа а sets OEE 020,580) = 45,613 
29 145 
Treatments X places: 131,925 — 55,509 — 45,613 = 30,803 


The pooled error m.s. requires a little thought. If all experiments 
have the same true error variance ¢,”, the best procedure is to weight 
each error m.s., s?, by the number of degrees of freedom n;. But if the 
experiments vary in accuracy, this weighted mean is a biased estimate 
of the component of error variance that enters into the interaction and 
treatments m.s. The correct component is 2 утуе;?/ рту instead of 
У то?/}у ni- Consequently, unless we are confident that о.? is con- 
stant, it is best to weight the s? values by the numbers of replications. 
This gives 

2 _ 4106) + 5(83 + 158 +--+ 52) 


Se 108 
29 


* Since experiment 1 contained additional treatments (not discussed here), it pro- 
vided 21 error d.f. 
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The F-test of the interactions m.s. against the error m.s. is carried out 
in the same way as with experiments of equal size, and is subject to the 
criticisms previously discussed. The F-test of the treatments m.s. 
against the interactions m.s. encounters a new difficulty that is due to 


TABLE 147 PRELIMINARY ANALYSIS OF VARIANCE FOR EXPERIMENTS 
OF UNEQUAL SIZE 


Source of variation df. 5.3. m.s. 
Places 5 55,509 11,102 
Treatments 4 45,613 11,403 
Treatments X places 20 30,803 1,540 
Pooled error m.s. 81 108 


the unequal numbers of replications and is present even if all the as- 
sumptions required for the analysis of variance are satisfied. Under 
these assumptions, the expectations of the principal mean squares in 
table 14.7 are as given in table 14.8. 


TABLE 14.8 EXPECTED VALUES OF MEAN SQUARES (WITH UNEQUAL NUMBERS OF 


REPLICATIONS) 
Source of variation Expected value of mean square 
(у — 7% 
Treatments ed + Мор + (Dr) ке” 
Treatments X places сё + fao 
Pooled error аё 
where 
Yr? 141 
ж = Se = — = 4.862 
A 225.29 
1 1 
-——— ; — Я) = (29 — 4.862) = 4.828 
7a ean" 1) 5. 


The complicating factor is that the coefficient of o? in the expected 
treatments m.s. is not the same as that in the expected treatments X 
places m.s.; in fact, it is always larger. The difference is very small in 
this example, as it is whenever the experiments do not vary much in 
numbers of replications. In addition, and for the same reason, the in- 
teractions m.s. is not distributed as a multiple of chi-square; that is, it 
does not have the type of distribution required for the validity of the 
F-test. Both factors tend to make the F-test give too many significant 
results. 

А crude method which at least prevents us from being led too far 
astray is to adjust the F-ratio so as to remove the upward bias in ex- 
pectation. Let the expected values of the three mean squares be de- 
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noted by бь, бр, and б„, respectively. Then, if the null hypothesis is true 
(all 7; equal), it is seen from table 14.8 that the three expected values are 
connected by the equation 


6; + (Е — 16. = К, 


where k = 7,/r;. In other words, the test that we seek is a test that this 
relation holds. This suggests that F might be computed as 


s? + (k — 1)8,2 _ 11,403 + 0.007(108) | 
Ёз? — (1.007)(1540) 


as compared with the original F-ratio of 11,403/1540, or 7.40. The sig- 
nificance level of F’ will also be altered slightly from that in the table. 

When interactions are likely to be large, and the chief interest centers 
in the test of the treatments m.s. against the interactions m.s., an al- 
ternative approach has much to commend it. This is to compute the 
analysis from the unweighted treatment means given in table 14.1, paying 
no attention to the numbers of replications. With this type of analysis 
the bias disappears, the expectations of the mean squares being shown 
in table 14.9, It will be noted that c,? and c,? carry the same coefficients 


в = 


TABLE 14.0 EXPECTED VALUES OF MEAN SQUARES IN AN UNWEIGHTED 
ANALYSIS 


Source of variation Expected value of mean square 


2 Х-ті 
т, ts fe 2 07У ты, 
reatment т Tto +p @—1) 
oe 
Treatments X places + op? 
h 


in both expectations. The divisor 7, is the harmonic mean of Бе num- 
bers of replications, given by 7, = р/У) (1/7). Moreover, if ap? is much 
larger than c,2/r,, the interactions m.s. tends to be distributed as a 
multiple of chi-square, so that the conditions for the F-test are closely 
approximated. 

It is suggested that the pooled error m.s. for insertion in an analysis 
of this type be calculated from the formula 


By an extension of the results in table 14.9 to the case where the experi- 
ments have different error variances, this quantity is found to be an un- 
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biased estimate of the error component that enters into the mean squares 
for treatments and treatments X places. For the data in table 14.1 we 
have 


82 


ies Sitot 
— t] 224 


TNR 5 


TABLE 14.10 UNWEIGHTED ANALYSIS OF VARIANCE OF TREATMENT МЕАМВ 


Source of variation d.f. 8.8. m.s. 
Places 5 11,692 2,338 
"Treatments 4 10,554 2,638 
"Treatments X places 20 7,158 358 
Pooled error 81 22.4 


Table 14.10, being in terms of treatment means, is in different units 
from table 14.7 (p. 557), Asa conversion factor, the harmonie mean fh, 
which in this case is 4.8, may be used to multiply the mean squares above 
for comparison with those in table 14.7, though this conversion is not 
needed for our purpose. 

In a choice between a weighted and an unweighted analysis, the fol- 
lowing are the relevant considerations. The statements below are 
strictly true only if all experiments have the same error variance per 
observation (or plot), though they should remain substantially true with 
a moderate variation in o}. The weighted analysis is superior for the 
F-test of the interactions. It gives a more powerful test, and the P-ratio 
approximates the tabular distribution of F more closely. It is also 
superior for the F-test of treatments if interactions are non-existent or 
small, since in this case the bias in the test is negligible. The unweighted 
analysis is preferable for the F-test of treatments when the interactions 
are not negligible. As it happens, the results in the example are not too 
well in accord with these statements. For the interactions test, the F- 
ratio is 14.26 with the weighted analysis and 15.98 with the unweighted 
analysis. ‘The explanation of the higher value with the unweighted 
analysis is probably that experiment 1, whose responses differed from 
those in the other experiments, had the smallest number of replications. 
The values of F in the test of treatments were practically identical, being 
7.35 (after adjustment) in the weighted analysis and 7.37 in the un- 
weighted analysis. 


14.32 The Combined Analysis for a Series of Lattice Experiments. In 
the discussion of lattice experiments it was pointed out that if inter-block 
information is recovered, the F-test of treatments іп an individual ex- 
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periment is not exact, because the relative weights attached to inter- and 
intra-block estimates are subject to sampling errors. For the same rea- 
son any combined analysis must also be approximative rather than exact. 

The designs need not be identical at all places. With 25 treatments, 
for example, some experiments might be simple lattices in 4 replications, 
while others are lattice squares in 3 replications. Also, inter-block in- 
formation need not be recovered in all analyses; at some places we might 
have used a randomized blocks analysis. 

If the experiments have different numbers of replications, the pre- 
vious section has indicated that there is a question whether to use a 
weighted or an unweighted analysis. The former is preferable when the 
chief purpose is to test the interactions with places, and when such in- 
teractions are negligible; the latter when interactions are sizable and the 
principal interest is in a test of the average effects of treatments. With 
lattice experiments as used in agriculture, we are often interested in both 
interactions and average effects. But as interactions are seldom absent, 
the unweighted analysis is frequently advisable, and has the virtue of 
being slightly simpler. 

The weighted analysis will be described first. First form a two-way 
“treatments X places” table similar to table 14.6. The entries in the 
table are the adjusied treatment totals, except in experiments analyzed 
by randomized blocks, where unadjusted totals are used. "This table is 
analyzed into components for 


Places 
Treatments 
Treatments X places 


If the numbers of replications r; differ, the weighted analysis follows the 
same procedure as in table 14.6 (р. 556). The square of every marginal 
treatment total is divided by У) ту in order to reduce the analysis of vari- 
ance to a single-observation basis. The square of a place total, summed 
over all treatments, is divided by tr;, where tis the number of treatments. 

The only additional component is the pooled error m.s. In any ex- 
periment where inter-block information has been recovered, we use as 
the estimate of the error variance per observation (or plot) the effective 
error m.s. Ё/. This quantity is obtained by adjusting the intra-block 
m.s. upwards so as to allow for sampling errors in the block corrections, 
and its calculation is explained along with the analysis for each type of 


lattice. For the triple lattice, for example, Ё; is ft + е 1, 


there are any experiments in which the intra-block analysis has been 
carried out, Е; is given the value which it takes when E becomes very 
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large relative to E;. For experiments analyzed by randomized blocks, 
Е; is the ordinary error m.s. 

On the assumption that the true effective variances are unlikely to be 
the same in all experiments, the pooled error m.s. is calculated as 
У nE?/ У). When т; is constant, this reduces to the unweighted 
mean of the E/. Tests of significance are made as in the numerical ex- 
ample, and are subject to the same general criticisms. Since the larger 
lattice designs provide ample degrees of freedom for error, the pooled 
error may be relatively well determined even if some experiments are 
much more accurate than others, so that there is less uncertainty about 
the true significance levels of the F-ratios. 

For an unweighted analysis with experiments of unequal size, con- 
struct a two-way table of adjusted treatment means. This is analyzed 
by the simple standard procedure. The comparable pooled error m.s. is 


taken as 
1/E,! Е Ey 
Инт) 
тү T2 D 


With balanced designs, the method given above is a natural extension 
of the technique used to obtain an approximate F-test of treatments in 
an individual experiment. With partially balanced designs the method 
is more crude than that used in single experiments, where a special sup- 
plementary calculation was made for the F-test. This calculation can be 
extended to apply to a combined analysis, but it is doubtful whether the 
elaboration is worth while. 


14.4 A Test of the Treatments Х Places Interactions 


In this section we give an approximate test of the treatments X places 
interactions for series of experiments where there is considerable varia- 
tion in the experimental error variances. Tt was suggested that the F- 
test, despite its imperfections, will often serve our purpose. The present 
test differs from the F-test in that it gives less weight to experiments 
which have a high error variance, and consequently may be expected to 
be more sensitive. It may be useful in cases where there is doubt about 
the verdict given by the F-test and in cases where the most efficient 
analysis is desired. r 

As before, let z;; be the mean of the jth treatment, and ø; be the true 
error variance per observation in the ИВ experiment. We assume that 
all experiments have the same design, and let 32 be the error m.s. per ob- 
servation in the ith experiment, based on n degrees of freedom. 

For known values of сг the theory of least squares indicates that the 
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mean ху; should receive а weight W; = 7/02. Ina weighted analysis of 
variance of the treatment means, the interaction s.s. is known to be 
distributed as chi-square with (p — 1)(¢ — 1) degrees of freedom. This 
sum of squares appears to be the best test criterion available, unless we 
possess specific information about the type of interaction that may exist, 
in which event a more specialized criterion would be constructed. 

In default of knowledge of the о;2, the natural step is to consider a 
weighted analysis with weights w; = r/s?. That is, we weight each 
mean inversely as its estimated variance, since we do not know its true 
variance. This is satisfactory, provided that the s? are good estimates 
of the 2. Upon examination it appears that the 82 should be based on 
at least 15 d.f. To illustrate the calculations, the test will be applied to 
the data from the last 5 experiments, though the number of degrees of 
freedom, 12, is slightly too low to use the test with full confidence. 


TABLE 14.11 TREATMENT MEAN YIELDS (100 LB. PER ACRE) FOR A WEIGHTED 
ANALYSIS OF VARIANCE 


Pounds Experiment Weighted total 
P10s 2 3 4 5 6 25 шау = Т; 
0 142 180 170 130 170 53.570 
40 176 188 171 140 178 57.000 
80 201 207 188 150 187 62.194 
120 205 208 185 152 188 62.264 
160 217 217 189 156 189 63.932 
ш 0.060 0.032 0.116 0.025 0.096 0.329 = W 
Total (Pj) 941 1000 903 728 912 
wP; 56.460 | 32.000 | 104.748 | 18.200 | 87.552 298.960 = С 
8.8. (S) 180,655 | 200,946 | 163,431 | 106,440 | 166,618 


The arrangement of the data in table 14.11 should be noted. The treat- 
ment means and s;? values come from table 14.1 (р. 547). Тһе weights 
for individual entries in the table are placed in the row immediately be- 
low the entries; for example, шу = 5/83 = 0.060. The total of the 
weights is denoted by W. 

1. Form the column totals Р; and the products w;P,;. Form the 
weighted row totals. The corner total, G = 298.960, supplies a check 
on both the column and row totals. 

2. The items in the analysis of variance are obtained as follows. 

Correction term: 

" G? (298.960)? 


== = 54,332.57 
tW (5)(0.329) ed 
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Total: First compute and record in the table the sum of squares S; 
(uncorrected) of the entries in each column. The total s.s. is then found 
as 
У) (08) — С = (0.060) (180,655) +- - -+ (0.096) (166,618) - С = 

551.33 

Places: A column (place) total has variance to;?/r, and hence receives 

а weight w;/t. The sum of squares is 
EX (Р) — С = 230.09 
This is found readily by using the products w;P;. 

Treatments: Each treatment total has estimated weight W. The sum 

of squares is 
ma Cu (53.570)? ++ +--+ (63.932)? 
т 0.329 

Treatments Ж places: Since this is the only component in which we are 
currently interested, it is unfortunate that it must be found by means of 
the others. As usual, it is given by 


551.33 — 230.09 — 229.57 = 91.67 = I 


The sum of squares 1 does not follow a chi-square distribution, being 
inflated by errors in the weights. It can be reduced to a quantity that is 
distributed approximately as chi-square.* We take 


C = 229.57 


аз n—A4)(n 2) Е фр —1)@—1)%—-4Ф, 
x= ni t 8) I, with (41-3) Hh 
In this case 
‚_ (009 3 090% омат. 
х = 12) (14) (91.67) = 43.65, witl (14) 


In this approximation the degrees of freedom ascribed to chi-square are 
notintegral Significance levels are obtained by linear interpolation in 
the tables of chi-square. Since the 1% levels of chi-square are 21.07 and 
23.21 for 9 and 10 d.f., respectively, 1 is obviously significant. 

The test can also be applied to any component of the interactions 8.8. 
As an example, we will test the interaction of the quadratic component 
of the regression on amount of phosphorus. This was previously tested 
by the F-test in table 14.5, and found non-significant. The values given 


* This test is an extension and modification of a test previously given by Cochran 


(14.6). 
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for the quadratic components in section 14.21 were derived from treat- 
ment totals; to obtain corresponding values from means, we divide by 5. 


Experiment 
2 3 4 5 | 6 | Total 
Фф | —65 —16 —14 —20 -22 
wQ: | — 3.900 | — 0.512 | — 1.624 | — 0.500 | — 2.112 | —8 648 


Each quantity has variance 14 с? /r, and receives а weight w;/14. The 
sum of squares is 


1 (У) =) 1 
— wg? — =) = — 340.89 — 227.32) = 8.11 = І 
in (x 9 ў T ) a 
The quantity Го is the sum of squares for the interaction of a single 
treatment comparison with all p places, and would normally have 
(p — 1) degrees of freedom. To present the general formula for con- 
version to chi-square, suppose that we have computed the sum of squares 
for the interaction of 4 treatment comparisons with Pı of the places, so 
that there would normally be 4(рі - 1) degrees of freedom. Then 
n — 4)(n — 2 t - 1)(% — 4 
2_( )( ) with 1(P1 — 1)( ) 
n(n + t; — 2) та — 2) 
This formula is in accord with that used for the whole (¢ — 1)(p — 1) 
degrees of freedom in the interactions. Since the ¢ treatments provide 
(t — 1) independent treatment comparisons, we would take и = (¢-1) 
in applying the formula to the complete interactions s.s. For 4 = 1, 


Dum 5, 
‚_ 00) 
_ (2)1) 


The tables show that this value has а probability between 0.2 and 0.1, 
since by interpolation the 20% level for 2.91 d.f. is 4.51, the 10% level 
6.10. The earlier F-test gave a probability slightly over 0.2, so that the 
two tests are in close agreement. 

If the experiments differ in size, the weight w; becomes r,/s?. The 
numbers of error degrees of freedom will also vary. This necessitates a 
more elaborate conversion formula: as a rough approximation the aver- 
5 number of error degrees of freedom per experiment is used in place 
of n. : 

The test is not recommended for low values of т, because the s;? are 


(8.11) = 492, with 9. 2.91 d.f. 
(11) 
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relatively poor estimates of the о, and frequently one or two experi- 
ments receive such high weights that they dominate the analysis. Ap- 
propriate methods are presented by Yates and Cochran (14.5) and 
Cochran (14.6). Somewhat more exact tests of a weighted sum of 
squares of deviations, as in the test of the quadratic components, have 
been worked out by James (14.1а) and Welch (14.2a). 


14.5 Repetitions in Both Space and Time 


As has been mentioned, agricultural field experiments are often re- 
peated both at a number of places and for a number of years. We will 
give only an introduction to the simplest case, in which experiments have 
been carried out at each of p places for y years. At any place a new site 
is chosen each year for the experiment, and a new randomization em- 
ployed, so that the data from successive years may be regarded as 
independent. The experimental arrangement need not be uniform 
throughout the whole series of experiments: in fact, a change in design 
between seasons is not uncommon. 

The mathematical representation of the mean of the jth treatment at 
the ith place in the kth year is now more lengthy. In addition to the 
symbols previously used for place and treatment effects, уь will denote 
the effect of the year. Interactions are denoted by multiple symbols: 
thus (77); is the contribution of the place X treatment interaction in 
this experiment. With this notation 


к = p т; ту + Үк + (пт); + (т) + Cn + (ттү)ук + бай 


Note that there are treatment Х place, treatment X year, and treat- 
ment X place X year interactions. 

In experimental programs of this type, it is usually hoped that the 
places and years constitute a representative sample of the population of 
places and years to which the results will be applied. There are obvious 
practical difficulties in choosing places and years that can be confidently 
asserted to be such a representative sample, and sometimes little effort 
is made to ensure that this will be зо. The hard fact is that any sta- 
tistical inferences drawn from an analysis of the data will apply only to 
the population (if one exists) of which the experiments are a random 
sample. If this population is vague and unreal, the analysis is likely to 
be a waste of time, at least from the strictly practical point of view. 

It seems appropriate to regard all the quantities in the equation as 
random variables except the general mean и and the true effects 7; of the 
treatments, because if we could tabulate all the values of say (тт); in 
the population, they would follow some frequency distribution from 
which the values in our data are a sample. For the full application of 
the analysis of variance we require the assumptions that the experi- 
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mental errors and all interactions of treatments are normally and in- 
dependently distributed, with variances that are denoted by use of the 
letters that enter into the interaction. 

An unweighted analysis of variance of the treatment means is ob- 
tained by the standard procedure for factorial experiments. The im- 
portant parts of his analysis are sketched in table 14.12. 


TABLE 14.12 ANatysis OF VARIANCE OF TREATMENT MEANS WITH TIME AND 
PLACE VARIATIONS 


Source of 


variation d.f. Expectation of mean square 
= = 

‘Treatments (2—1) vé ср! + pow? + yoi? + py ж-а 
Treatments 

Х places (t= 1) — 1) và + om? + уса 
Treatments 

Х years (t— 3(y = 1) сё + ош? + роц? 
Treatments 

X places 

Xyeus (4-1)р-1);-1) a? + orp? 
Pooled error oe 


In the most general case, c, is the average value of са /ri, where 
ci? is the error variance in the individual experiment at the ith place in 
the kth year, and rj, is the number of replications in that experiment. 
Consequently, the pooled error should be estimated as the average of 
з/т. 

The method to be followed in testing the significance of the successive 
terms is made clear by the expected values of the mean square. The 
treatments X places X years interactions are tested against the pooled 
error. Both treatments X years and treatments Х places are tested 
against treatments X places X years. 

The situation with regard to the test for treatments is interesting. It 
is evident that no other mean square in the analysis is suitable as a de- 
nominator of F, unless either the treatments X places or the treat- 
menís X years interactions appear to be negligible, so that the corre- 
sponding variance is assumed to be zero. И this happens, the other 
interaction m.s. is an appropriate denominator for F (subject to some 
uncertainty in case the assumption should not be correct). 

When both two-factor interactions are present, the hypothesis that 
all 7; are zero is equivalent to the hypothesis that 


9 + бр, = бр + Oy 
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where the 6 stand for the expected mean squares in table 14.12. No 
exact test for this kind of relationship is at present known. By analogy 
with the F-test, one suggestion is to use the ratio 


82 F вру? 

Sint + 8.2 
where the s? values are the mean squares in the analysis of variance. 
This ratio does not follow the F-distribution, but following an approxi- 


mation suggested by Satterthwaite (14.7) and others, we might use the 
F-tables with па’ and ng’ degrees of freedom, where 


where the n’s are the numbers of degrees of freedom in the corresponding 
mean squares. The analogous expression is used for "2. 

After completion of the analysis, the next step is to examine the nature 
of the interactions and average effects of the treatments, bringing to bear 
any external knowledge that will throw light on the interpretation. 
Often, for the practical uses of the results, we would like to decide 
whether a single recommendation can be made for application in the 
whole population, or whether, on the other hand, it is necessary to have 
several recommendations for different parts of the population. These 
decisions are facilitated by calculating confidence limits for the dif- 
ferences among the means of the most likely candidates. For a discus- 
sion of methods for making this calculation, which can give rise to some 


complications, see reference (14.5). 
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СНАРТЕВ ДӘ 


Random Permutations 


of 9 and 16 Numbers 


15.1 Use of the Random Permutations 


In practically all the experimental designs, the randomization con- 
sists in arranging a set of objects, whether treatments, blocks, rows, or 
columns, in random order. If the number of objects does not exceed 9, 
a randomization is obtained at once from table 15.6, which contains 
1000 random arrangements of the numbers from 1 to.9. For example, 
to arrange 7 treatments in random order, select a starting place in table 
15.6 (without inspection of the numbers in the table). Suppose that the 
permutation chosen is 


18/02 96217455; Sine 


Omitting the digits 8 and 9, we obtain 

1:21 d. a oe 
for the desired arrangement. Similarly, table 15.7, which contains 1000 
permutations of the numbers between 1 and 16, may be used for any 
number of objects between 2 and 16. 

Occasionally the randomization involves dividing the objects into 
groups, as in the completely randomized and cross-over designs. Thus, 
with a cross-over design (section 4.4) having 2 treatments and 10 blocks, 
treatment A must appear in the first row in five of the blocks, chosen 
at random, and in the second row in the remaining five blocks. Select a 
random permutation of the numbers up to 16 from table 15.7, say 


6 16.10 5 127 941215 14 18 1 8 8 


The numbers from 11 to 16 are ignored. The first five of the remaining 
numbers, 6, 10, 5, 7, and 9, are chosen as the blocks in which A appears 


in the first row. А 
If the number of objects to be randomized exceeds 16, see section 15.3. 


15.2 Construction of the Random Permutations 


The idea of using random permutations instead of the ordinary ran- 
dom digits was suggested by Professor George W. Snedecor. Before the 
569 
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writing of this book, 200 random permutations of each of the numbers 
from 3 to 13 had been made at the Statistical Laboratory at Iowa State 
College. These were obtained by first constructing a large number of 
ordinary random digits, which were then transformed into random per- 
mutations. The method of transformation was the usual one: it may be 
illustrated for the numbers from 1 to 11. To obtain a random per- 
mutation of the numbers 1 to 11, a series of pairs of random digits is 
taken, say 
65, 04, 29, 82, 37, 52, 53, ete. 


When we divide each number by 11 the remainders are 
10, 4, 7, 5, 4 (omitted), 8, 9, ete. 


This process is continued until 10 of the 11 numbers have appeared, 
repetitions being omitted. 

Rather than present 200 permutations of each of a series of numbers, 
it was decided to concentrate on two numbers. This decision was made 
partly because any permutation of 9 numbers, say, can be used to give 
permutations of all numbers less than 9. Thus 1000 permutations. of 9 
numbers are more useful than 200 of each of the numbers 9, 8, 7, 6, and 
5. Also, certain numbers, such аз 11 and 13, are less often used in ex- 
periments than others. The numbers 9 and 16 were selected for presen- 
tation because of their occurrence in factorial experiments. 

The 1000 permutations of 9 were obtained from the permutations 
already available for 9, 10, 11, 12, and 13, by omitting the numbers 
above 9. Of the permutations of 16, 800 were obtained from random 
digits. With as many as 16 numbers to permute, the method of division 
as illustrated above is rather slow. A more rapid alternative, suggested 
by Mr. Paul Peach, was used. In this method 16 pairs of random digits 
are taken, for example, 


7% 10 17 28 91 85 74 56 71 06 26 01 06 21 07 60 
14 WG “6! gaiga? 18.039:919:340 & «4 Нет 


The permutation is produced by ranking the pairs in order of size, as 
shown below each pair, In the event of a tie, as happens in this example 
with the two 06's, the two ranks in question may be allotted from an ad- 
ditional random digit. Ties occur frequently (the probability that all 
16 pairs are distinct is about .28) but cause little delay. Although the 
method has the additional merit of using fewer random digits than the 
method of division, with 32 random digits per permutation a large supply 
of random digits is required. The remaining 200 permutations were ob- 
tained by drawing numbered marbles from an urn. 
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15.3 Randomization of More than 16 Numbers 


The most suitable method depends on the facilities available and on 
the frequency with which randomizations have to be made. Navy beans, 
drawn from a box, have been found quite expeditious and convenient, 
though the numbers on the beans rub off with repeated usage. Al- 
ternatively one of the standard sets of random digits, references (15.1- 
15.4), may be used. For example, with 25 numbers to be placed in ran- 
dom order, choose a starting place in the table and select the next 25 
sets of three-digit random numbers. These numbers are then placed in 
increasing order by the method used in the previous section for the per- 
mutations of 16. Three-digit random numbers are taken instead of two- 
digit numbers in order to avoid ties. With three-digit numbers the fre- 
quency of ties is negligible even if as many as 100 numbers are to be 
randomized. If punched card machines are available, the numbers 1 to 
25 may be punched in columns 1 and 2 and the random digits in col- 
umns 3, 4, and 5 of the card. The permutation may then be produced 
and printed by sorting on columns 3 to 5. By filling all the columns from 
3 onwards with random digits, a number of permutations can be obtained 
by sorting on different sets of columns, though the number is limited if 
the permutations are to be kept independent of each other. 


15.4 Tests of Randomness 


Many tests of different aspects of the randomness of the permutations 
could be made. So far as their use in experimental design is concerned, 
the test most immediately relevant is a test of the null hypothesis that 


TABLE 151 NUMBER OF OCCURRENCES OF 1, 2, «++, 9 IN 18T, 2ND, -**, 
Этн POSITION IN 1000 PERMUTATIONS OF 9 
Position 


Nudbe| . a oda тіке 1| Ten 


130 іп іп 108 94 121 14 106 105 1000 
120 118 104 102 102 133 98 108 115 1000 

92 100 16 ш 13 10 16 123 119 1000 
121 98 97 108 17 112 124 102 121 1000 
122 84 113 113 10 1000 
107 118 118 88 13 120 124 111 101 1000 
108 119 121 124 10 94 95 113 16 1000 
115 116 110 98 107 109 117 121 107 1000 
112 17 108 121 12 17 99 103 106 1000 


бө Ea M о Ны. 
© 
[i 
= 
e 
e 
= 
8 
= 
5 


Totals 1000 1000 1000 1000 1000 1000 1000 1000 1000 1000 
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SPOT, 


0001 0001 0001 0001 0001 0007 0001 0001 0001 0007 0001 0007 0001 0001 0001 00017 0001 818301, 
0001 99 69 T9 29 09 99 Li 04 76 Ie 09 +e 89 69 Uu 99 9t 
0001 20 99 69 349 % (09 т 90 69 052374: С-ТА М9 — 89! Q4 $9 er 
0001 02 49 Я 6¢ 83 ec 89 тА 69 b 99 19 or 09 12 29 PL 
0001 99 $9 49 Gg се ГАЧ 99 64 eg 63 ға Lg 29 69 в LS ет 
0001 4^; 98 0 55 90/00 сс 9 89 64-19 90 99 gg yp сї 
0001 МШ 99 % 89 GL S9 % 6% Tg 89 84 ФУ 79 c9 gg ТІ 
0007 м 80 те 19 18 39 29 ec 18 59 09 t9 59 89 69 79 от 
0001 19 89 99 SL $9 I1, £9 6° д 89 £9 A 39 $9 4 ag 6 
0007 Ыс 089 58:-99- os 909^ %4 0р Чу ру 4951 507 869 $9 ғ 8 
0001 9° e9 64 T9 +e 0. #1 98 29 yo 59 89 8% 25 og 19 2 
0001 99 м 9€ 99 §9 164 c9 79 6 9 090 S9 9 c9 09 c 9 
0001 mi 20 89 ш вд % 0 % 7/2 go $9 c8 gg д $9 19 e 
0001 99 19 6c 33 29 95 89 59 E y9 02 99 59 99 49 #9 y 
0001 IG 99 73 99 84 19 бе £L e9 OL тА 19 6% SP 9g 99 ІЗ 
0001 89 89 и 99 60 бо $9 OL e 90. 739 үс. 59" ду Sood ер; [4 
0001 vL 68 99 19 6€ 68 £9 SL 8€ и 99 49 28 0% 02 “oe Т 
9L Пар SIS "SIE HS 0b 6 8 4 9 3 Ф е 2 I шау 
uonisoq 
Эт 40 вхошудпиняа (001 NI NOLLISOd HIQ[ *... ‘ANG чет NI QT <>. ‘Z Ч 40 S39Ng3332290 ао UTRAN Z'S AIAVL 
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the numbers 1, 2, ---, 9 have an equal probability of appearing in the 
156, 2nd, ---, 9th positions. This test is made by counting the number 
of times that each number has occurred in each position. The results 
are shown in two-way presentations in tables 15.1 and 15.2. 

For the permutations of 9, the expectation in each cell is 1000/9, or 
111.1. Each row and each coluran in table 15.1 adds to 1000. Taking 
the values in a single column, we may test by x? whether the 9 numbers 
are equally represented in the corresponding position. If the z's are the 
observed entries in the column, 


peer Pa. 9 BG) 


7 1000 9 


For computational purposes, this simplifies to 
х2 = фус) (е) — 1000 
Thus, for the first position, 
x! = reo (130° + 120° +- ---- 112?) - 1000 = 10.81 


with 8 d.f. Similarly, from the rows we may test whether any given 
number has appeared equally often, apart from sampling fluctuations, 
in each position. Although the two sets of tests are not independent, 
both are of interest. The values of x? are shown in table 15.3. 


TABLE 15.3 VALUES or x FROM 1000 PERMUTATIONS OF 9 


Position Number 
test x test x 

1 10.81 1 7.46 
2 5.19 2 9.17 
3 5.08 2 6.70 
4 17.16 4 7.75 
5 6.06 5 18.90 
6 15.52 6 8.97 
7 8.82 7 8.25 
8 3.88 5 3.45 
9 3.44 g 5.32 

Totals 75.96 75.97 


In the position tests, the individual probability values range from 
.90 (x? = 3.44) to .028 (х2 = 1716). With 8 d.f., the probability that 
the largest of 9 independent values of x? should exceed 17.16 works out 
at 23. Actually, owing to the fact that the cell numbers add to 1000 
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in any row or column, the 9 values for the position x” have a small pos- 
itive correlation which reduces the probability value for the largest x? 
to about .20. The individual probability values for the number tests all 
lie between .90 (x? = 3.45) and .015 (x? = 18.90). The probability that 
the largest x? should exceed 18.90 is about 0.11. Thus neither series of 
tests indicates any marked departure from randomness. 

The total, 75.97, which is У) (с — m)?/m taken over all cells in table 
15.1, supplies a composite test of the complete table. On account of 
the positive correlation referred to in the previous paragraph, the value 
75.97 does not itself follow a x? distribution. In fact, it may be shown 
to be distributed approximately as (9/8)x?, where x? has 64 d.f. Con- 


(8) (75.97) 
e 


sequently we may test or 67.52, as а x? value with 64 d.f. 


Аз would be expected from the results of the previous tests, the prob- 
ability value, .36, provides no evidence for the rejection of the null 
hypothesis. 

The corresponding x? values for permutations of 16 are given in table 
15.4. 


TABLE 154 VALUES or x? FROM 1000 PERMUTATIONS оғ 16 


Position Number 
test ж {ев x 
1 14.40 1 20.25 
2 17.70 2 12.96 
3 11.87 3 16.58 
4 15.14 4 4.80 
5 17.70 5 14.94 
6 7.42 6 9.57 
T 12.29 7 13.98 
8 7.42 8 13.63 
9 24.54 9 15.55 
10 9.15 10 12.45 
11 10.62 11 14.05 
12 16.51 12 18.56 
13 9.92 13 14.50 
14 18.85 14 13.06 
15 6.98 15 14.78 
16 19.36 16 10.21 
Totals 219.87 219.87 


The only figure in the group which has an unusual probability is the 
low value 4.80 for number 4. This has an individual probability of .9932 
(15 d.£.). When account is taken of the fact that this is the smallest of 
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the 16 values for the number tests, the probability is found to be .10. 
The total, 219.87, is distributed as (16/15) x? with 225 d.f. and is found 
to be slightly but not abnormally below its expectation. 

A further test of randomness was made by counting the number of 
inversions for each permutation. The inversions are counted as follows. 
For each number, record how many numbers to the left are greater than 
the number. Thus for the permutation 


348197265 


we record 
0, 0..0. „3 07:2, 5: 314 


The total (3 + 2 + 5 +3 + 4 = 17) gives the number of inversions. 
The extreme values for the number of inversions are zero (for the per- 
mutation 1 23 4 5 6 7 8 9) and 36 (for the permutation 9 8 7 65432 1). 


TABLE 15.5 OBSERVED AND THEORETICAL FREQUENCIES OF NUMBERS OF 
INVERSIONS FOR PERMUTATIONS OF 9 


Number of Observed ‘Theoretical Goodness 


inversions frequency frequency of fit, x* 

0-6 8 6.3 0.46 
т 4 6.0 0.67 

8 п 9.9 0.12 

9 10 15.8 1.84 
10 13 22.1 3.75 
11 36 30.4 1.03 
12 38 39.7 0.07 
13 50 49.5 0.01 
14 71 59.1 2.40 
15 65 67.8 0.12 
16 80 74.6 0.39 
17 71 79.0 0.81 
18 90 80.6 1.10 
19 75 79.0 0.20 
20 80 74.6 0.39 
21 57 67.8 1.72 
22 58 59.1 0.02 
23 45 49.5 0.41 
24 40 39.7 0.00 
25 27 30.4 0.38 
26 27 22.1 1.09 
27 18 15.3 0.48 
28 7 9.9 0.85 
29 11 6.0 4.17 
30—36 8 6.3 0.46 
Totals 1000 1000.0 22.94 
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In intermediate cases the number of inversions serves as a measure of 
the extent to which the order is inverted from the natural order towards 
the reverse order. The frequency distribution of the number of in- 
versions has been studied by Rosander (15.5) and Kendall (15.6), who 
derived from the number a measure of the rank correlation between two 
series of figures. For random permutations of 1 to 10 numbers, Kendall 
also tabulated the exact distribution of the number of inversions. His 
distribution was extended to 16 figures for the test given here. 

For permutations of 9 the observed and theoretical distributions are 
shown in table 15.5. 

The total x”, 22.94, has 24 d.f. and agrees closely with expectation. 
The corresponding value for permutations of 16 is x? = 47.52 with 54 
d.f. and a probability value of .72. 


REFERENCES 


15.1 'ТїРРЕТТ, Г. Н. C. Random sampling numbers (10,400 digits). Tracts for 
computers, XV. Cambridge University Press, 1927. 

15.2 Kenpatt, M. G., and Влвіхатом SwrrH, В. Tables of random sampling 
numbers (100,000 digits). Tracts for computers, XXIV. Cambridge Uni- 
versity Press, 1939. 

15.3 Fisuer, В. A., and Үлтвв, F. Statistical tables for biological, agricultural and 
medical research (15,000 digits). Oliver and Boyd, Edinburgh, 2nd ed., 1943. 

15.4 SwEpDECOm, С. W. Statistical methods (10,000 digits). Iowa State College 
Press, Ames, Iowa, 4th ed., table 1.2, 1946. 

15.5 RosawpER, А, С. The use of inversions as a test of random order. Jour. 
Amer. Stat. Assoc. 37, 352-358, 1942. 

25.6 Кемрлы, М. G. A new measure of rank correlation. Biometrika 30, 81-93, 
1938. 


577 


TABLES OF RANDOM PERMUTATIONS 


15.5 


15.5 Tables of Random Permutations 
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TABLE 15.7 Permutations оғ 16 (Continued) 


122 11 4 16 11 10 25 514 2 14 13 16 6 
6 15 13 10 1 4 13 11 8 16 16 4 3 5 15 5 15 
16 11 15 5 14 14 п 1 14 15 15 13 5 7 n n 16 
$ t5 7 6 2 16 1 14 9 1 3 8 1 6 16 6 10 
7 13 10 16 7 2 12 6 12 6 13 8 935 9 іш 
9 12 з 3 10 5 6 5 16 12 10 15 10 11 4 9 8 
14 10 4 14 13 ва 12 41 5 10 14 16 5 7 9 
10 3 9124 69/8 * 7 3 7 31114 7.3 14 4 12 
Li 8 13 15 13 9 438112 6 9 8 15 14 
247 55 9 9 8 10 11 13 8 5 18 149 31856 
1 6 21 3 8.8 15 71:1 »9'7 8 8 6'2 19$ 
5 9 6 9 4 10.5 8 13 10 7 9 15 2 10 8 4 
13 16 Гез 12 {$ 16 15 2 4.2 HB ox T ond 
15 4 16.1 12 7 15 9 10 9 12 16 4 13 2 10 13 
8 8 14 7 5 10 44 13 12.24 32 "1 16 14 11217 
чи 12 8 815 16 7 6 1146.6 12 10 12 м 2 
4 5 78241 WO ^3 5 1 15 2 м 18 18 1 2 13 
15 12 12 6 15 6 15 6-3 12 5 15 a 46-0 891 
6 7 6 18 15 97518 15 6 4 15 1 14 5 14 10 4 
т 8 15 8 4 12 13 16 9 3 10 7 2 12 S9 м 
14 15 4 12 1 4 10 8 12 1 4 16 6 2 216 7 
п 4 13 5 10 3 14 M209 п 3 9 9 12 12.41 
16 9 2 16.2 15 6 Y ЧБВ 51,8 12 413 6 9 
13 13 16 10 14 13 9 10 7496 5 8-4 1149 
Э ВІ 815 3 ви 275,022 -3 10 10 7 13 10 
3 13 9 9 13 10 1 $ 28.7 "8c9 4 15 15 715 
2 ми 6 14 12 9 10 16 12 13 3 7 5 4% 
1 6 313285 5 14 16 2 7 12 16 14 10 15 5 
5 10 ¥ #1002 02417 1 4 6 16 10 8 8 1 5 16 
10 2 5.7 16 1 16 зии 6 5 1 31 16 '8 78 
8 14 1014 7 n 3 4 13 13 13 П 15 3 6 14 6 
12 16 1.4 9 16 4 12 14 5м 4 T'í 8*2 
8 9 14 14 2 13 1 8 4 15 16 7 6 15 13 13 18 
9 8 10 12 13 14 5 1 10 10 12 9 10 5 16 6 3 
іш 18-41 14 9977 16 5 8 3 11 i2 5 12 5 1j 
3 1 758 135.65 4 3 716 812 8 7 8 9 4 
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5 15 8: 99-32 21:85 10 8 3 14 13 2 110 3% 
п 4 9 3 6 11 12 15 9.7 5 2 84145 
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10 4 1 4 16 6 1 12 16 15 3 16 3 6 16 6 8 & 9 1i 
6 1 6 5 6 1151 7 8 11892 16 10 M 14 2 
1 8 16 16 5 13 18; 6 15) 2 15 6 1%, & д 10 по 8 2 14 
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TABLE 15.7 Permutations оғ 16 (Continued) 
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TABLE 15.7 Permutations оғ 16 (Continued) 


Dg А Телі Se 8 Г ей) 10 8 11 14 16 2 13 
CD gt Rot 52 SA Pl ed bet е Зжб 471 
8 5 16 15 16 15 16 13 15 11 5 12 14 7 15 1 12 
te 02523. 6 2 10 11 13 13 3 6 9 16 4 13 6 
ы 87595 95,9 2, бада За 1 1594 8-46-02 7 10 
1.12 14.14 1 Мый Т ә 59 15.142 1.15.. T 16 5 
14 7 5 11 10 1612 6,446 2 0. 23. 8. 1 6 3 
16 13 6 13 15 115 1 8 15 135.926.5.8 8 14 
BS D В 12. 8 10 2 4 6 15 16 1 1 4 7 
7-6 10 au T Тә T 230216 lO Ta 1,1849. 6 15 16 
S26 1 216 ate 1012 8 7.12 11 221 57,14 .14 14 11 
411 212 5 4 13 4 1 5 8.5.12. 6,,9 9 9 
1010 3 6 9 11.34 4159. 2 4.5,2,32 5. 8 
11 714 8 9 13 47, Жы баб 14 16 15 13 12 12 2 
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е еф Б) 7 12 12 16 3 10 8 612 4 15 7 
T 90-478 5 15 4 3 9 14 15 9 3 15 91 
5 12 6 6 9 15 3 5 13 1 12 14 4 6 16 1 10 
Does 1 216 216 2 7 0010 4 9..4.9,14.9 12 4 
22028 10126 1981 15 16541 1 12 
1525162 үч] ас 64-3 4311 5 1 13 1 
228 9204 271 8 513 5 5 15 16 13 1 8 8 5 
8..4,.2 12 12 4,1.7..8,10 1 9 8 2 13 11 16 
16 16 11 10 10 12. 6 2 6 12 $2454,1.10 1 4 14 
4 i 277121 9 13 6 12 13 6 13 15 13 3 7 15 
10 14 15 3 7 1 4 1 14 10 717116 8 5 3 3 
dé 14m. € 6" 8 10. 1"H 16 12 12 n 10 10 13 
16 4 6 16 9 2 5 11 10 5 14,2 14. 8 8 13 
6 12 12 2 15 16 3 14 9 1 8.5.8 9 9 14 8 
13 5 14 7 12 8215450,1 13 6 9 5 1 Ж”! 
8 216 6 13 6 12 7.2 12 16 9 14 1 13 5 16 
492,9 712 216. 1 6 16 13 3 и 16 3 10 6 13 15 
10 218 210 21 221 5.4.8 1 15 12 15 1 4 7 7 10 
14 8 8 15.8 9 9 6 S8 16 15 13 10 8 il п 3 
1101113 2 4,T,29.7 дд 6 10.1.7 2 16 6 
а 2 ad ааб 8 16 15 14 7 2 1212 6 5 6. A 
4 10 28 20 127 13 8 1 3 6 14 3 13 15 12 15 14 
„+ ЕР. 4 14 15 3 15 10 8 8 16 2 10 10 11 
T 4&8 4.44 Д 1113 5 8 5 1 7 16 15 6 12 
Жж os ak Des] 7 10 10 12 4 2023.5 11.3 25 
15 14 13 8 4 30 12 55 469 9 7 4 12 16 з 2 
9 а 20 12 13 91 9 4 4 15 13 14 то 
Сыды ыл 15 714 “12 "4 14 Г: тре UM % 12 4 
15 9 15 13 16 410 515 4 60 6 4 7 9 6 
T LE. Д m qud 3 10 1 14 11 8.5 9.5 6 13 4 
12 10 1 11 14 1 547 Al «5 40 13 4 14 11 2 7 14 
i974 9 д 21 1 8 2 16 15 4,6 13 7 12 14 7 
4024 40 29 а 24 41.9 42.83 15 8 M 10 15 15 5 
2 15 8 6 10 15 5 3 7 12 1x3 да 16 -1 5 13 
В 23 gh ра 10 12 16 10 1 2 4 7 14 n 4 11 
14 12 14 10 7 8 4 10 1 14 16 1 12 1 16 $ os 
11 1 13 15 6 16 14 13 i3 7 14 13 3 15 5 2 15 
5 5 В LT pe 12 9 12 913 121 1 13 4 1 10 
18 7 2 2 2 3 6 6 2 6 21 49 48 «4939 .8 ^ 19 
9 28 oe в 6 16 7 4 9 5 2 10 12 13 п 2 
се И 2:5 7 2 15 11 16 10 7 16 3 14 12 16 
6 1 6 16 11 5 314 6 8 9 16 15 9 8 16 1 
$9 5 16 м 8 9 13 4 8 5 815 4 8 9 3 9 
4 16 5 12 4 215 8 3 2 т 12 5 6 10 10 12 


erdum. aia. 
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TABLE 15.7 Permurations оғ 16 (Continued) 


15 10 15 13 3 
142% a a oe Be he @ 6 10 5 16 
вы 3 is ba Yo da o 5 A 12 6 2 13.2 13 15 4 
{УЗ PPR didi BRAS 
; 5 3 11 10 
i251 Н BESSP шалыс 
2 56 4 11 16 6 ce 
iP лаз тю 7 612 15 нити 
16 15 9 uu 
254% 6 14 11 13 2 6 1 10 24 Тү 
2 3 13 ке 
ЖҮ 91316 3 1 10 16 TCU 
15 14 4 өт 
1$ < 2 15 12 4 7 137 DU s 252% 
cw eas июге 33813 от б 3 16 ва 
вит 2 95 55420 5 08 6 1 15 11 15 11 11 м 
3 
à s 51130 йын EE 6125 Бин 
3 
18 6 6 11 10 8% 54 55 20 зз, 
зілі Hagen ИИСИ cages 
i id 18 y. 15 1 5 13 10 3 * ИГҮ 
зо 8 9 12 13 КЕЛГЕЛІ 
Aer 81114 2 2 
ЖЕБЕ Байны BM DUI Lieu 2 
tap В 16 1 2 4 1 2 и 1 1 
13 4 4n 
ə Р 5 8 $ l4 16 4 a 18 + ка бо er 5 8 2 1 в 
ш A HE 3 7 12 
16 14 7 15 8 GR м № 15 $3 2 23 : 
Е зи вю 19-21 a» 27 
2 Ul 7 5 2 10 10 55.32 
а СС 10 11 11 12 м H 4 10 
11 15 16 9.282. 05 
12 15 13 12 16 9 55 de Lo Mo 2 6 s 4 
d № 3 15 12 16 
i x vu 6 8 10 16 12 3 15 M 
i ta 8 7 35 5 
15 4 2 10 8 АБ us bs DS DM. 5 TT E : 
Жү: 10 21 05 w4 Б PT т: аз a 05.06 18 4 10 18 7 
T PI 1 15 14 15 10 (fee иа 
EE з аз азм 18 Nek 4 NS 16 1 3 4 19 
o di 16 13 13 12 12 12 14 4 18. 4 "9 қасып 
MT. 5 ae 535 3. шіл 
У "Ed 0 — 16 7 10 12 
DENM 410 6 3 1 свет 
5 10 3 12 18 21 1 9 5 4 Mo Wa aono КЕКЕ 
7 DY B cy 1 15 5 4 Я 1 16 10 13 9 1 5 вжи 
de 4 1 5 12 11 10 14 ЗА A se T4 ТЛ 06710 
3514 шщ ии n HE 24443 
5 16 4 E 
Ub TM T1. 75417 5 ваи 
: : ; Mos 4 в 6 1 о ЕЕ 2 11 411 9 10 и 
BHIBIHIBIHT EERE 
р n 
11438 (3022 шаага 
7 716 10 12 10 10 16 6 и 49 99 85 Yi He dH 7а з 
15 14 1 16 6 то "s Ш 18 13 12 5 A ed 
TIE з 6 1316 6 rig a ЖОЛЫ 5 12 12 2 4 
2 8 10 6 4 D é 51860408 
18 d 2 2 1 9 13 4 15 1 25%. ; з s " 
12 
; 25555 нонии бшш 
sidus dm м Msi ET o 26 
EEE 63 01 LE Do E 42 v1.11 02 ye 
D 25:2 do жиш 19 oa мавз 0 м lug i 
16 7 6 2e НЕ TE 
чоп п E B E 8:55 9 i 5 3 1 р 2 i 6 16 Н 1 
4 11 1 8 15 4 13 10 2n Бе 
HERO. mus o вит в mig 6 1 
15 12 16 10 8 2 1 16 6 10 т 6 4 8 < 9 06 : 
агасы. 8 13 3 1 10 nou зл з 49245 
7 Jo 7 14 15 11 13 , 16 5°2 d4 M1 1 aie 5з i 
9 8 24 6 14 2 12 6 8 16 6 I @ bio pis H3 ji 
Б 24.1 13 3% 10 14 4 4 9 71 5 5 "NIME n 
205 4755 шесі e BE Sete 4 ih 
21014 5 5 759215 чы к 
É 1 3 в ч 13 16 5 516 


590 
536 
3 6 10 
9 13 3 

15 10 14 

10 2 8 
212 2 
1 8 13 

14 5 5 

ВЕТ 
6 14 1 
ұғын ЖЕ 
8 916 

11 15 

16 11 9 

12 15 12 
4 126 7 
211 6 

11 15 14 
9 14 12 
8 133 
8 110 

14 3 2 

12 4 11 
Mut 

10 12 16 

16 2 7 

13 10 9 
7 16 15 
4 5 4 

15 8 8 
5 9 13 
6 6 5 
516 1 

1415 8 
4 13 14 

11 10 9 
3 6 10 
8 9 16 
1. by xy 

18 8 4 

16 12 11 
714 6 
6 4 2 
Я Ру 58 

10 2 5 

12 11 12 
9 3 13 

15 5 15 
1.5 8 

15 6 13 
82717 
8:79 91 
T VIS. 2.6 
5 2 14 
9 12 9 

16 71142 

11 14 12 

14 8 11 

10 10 4 

13 1 16 
6 4 6 
2 15 15 

12 16 10 

4 R 3 
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TABLE 15.7 Permutations ОР 16 (Continued) 


Vaud ЖО ЖОНО PE 1014 8 9 9 9 11 16 16 13 
13 4 4 14 1 12 10 Тиек хе? gs 6/22 2755554 
12 16 ИЕ ed de САЛЫН 10 77-33 1% 
2 13 211 2 15 15 Cat fab НАН, 16 16 9 9 7 
6 12 13 15 11 16 8 3 6 3 1 15 3 9 12 8 10 
5 10 6 10 8 11 14 6 13 10 7 5 14 13 2 13 8 
3 15 25.2 2189, Тө 51 DD Ce ba 12 14 8 10 5 
8 5 12 8 10 13 9 16 2 6 16 2 315 6 3 fy 
41731 16 73.6 812 410613 1533: 5e 2 зии 2 
15 7 1 6 4 10 6 15 16 11 15 M 13 10 6 15 12 
м 9 5 6 6 3 12 14 10 12 2 10 T 4 3 3'3 
9 6 11 023^ T9 y s 8 9 9 4 13 8 810 7 9 
1 u M 111 2 7 8 14 10 1 5 113121 
10 14 3 712 25 18 15 7 13 12 15 5 4 4 15 
16 8 7 13 16 6 3 11 15 51 4 6 1 M 16 
4 8 9 415 5 4 512 1 14 6 1 9 157604 
14, 1 910 9 5 5 8 8 6 7 Б 14 15 7 8 7 
ДАЈ 15 18 8 4 2 11 38 277071 8.12 5 11 11 
11 10 12 7 12 16 M T 35 1 8 14 4 6 8 М 15 
15 2 8 12 10 9 12 14 9 4 5 12 5 13 16 4 1 
8 18 10 1 2 14 4 10 10 13 1 10 6 2 12 16 10 
9 п 7 4 14 18 15 18 7 16 15 2 12 3 11 12 16 
2 15 614 7 8 6 16 1 15 13 16 8 T7 10 13 6 
10 5 16 16 16 10 16 АСЫ "3 33 18 1 13 15 18 
4 9 14 6 112 8 51 516 9 9.4 9 T7 X 
1 6 i8 2 5111 3 16 12 12 8 16 9 14 6 8 
6 4 5 9 6 15 13 412 8 4 5 116 4 5 8 
16 12 4 *8 <8 *9 ^T 6 4 210 3 10 10 1 9 5 
53 1.98 18 2221 1141101 4 91 210 2 
18 16 38151 7 9 15 210 6 7 тм 6 2 14 
8 14 151 33 9 6 9 3 18 15 5 3 1 12 
12 18 211 4 6 10 2 5 11 м 6 n 815 3 9 
8 13 8 15 4 10 8 15 4 15 16 8 з 8 14 16 1 
7 4 6 8 14 13 9 6 14 14 15 16 16 6 13 14 M 
4 10 13 7 2 8 4 101 1H 6 4 3 16 10 12 
15 16 3 5 5 12 16 16 12 13 6 5 6 16 15 13 2 
nu 7 11 11 12 16 13 T poe 19 us 10 14 7 15 
3 6 1335 755 58 2 9 110 4 М 41 817 
12 58 4 12 13 6 15 14 2 52 3 15 15 12 1 8 
28 2 6 10 4 7 12 3 3 1 10 113 3 9 10 
13 2 45:9 51:7 0d ҮРӨ ag 2 n 1 e 71 
6 1 12 зим 6 п 5 16 14 7 2 510 2 6 
1 15 5-116 2 5 9 8 10 3 M 2% T Жл. 
м 11 10 9 15 n п 4 15 2 12 12 9 10 1 15 13 
16 14 710 9 1 1 8 612 8 1 5 2 51 9 
5 5 16 4 взи 5 13 6 71 13 12 9 4 16 
9 9 14 14 3 15 2 310 9 4 9 8 9 812 5 
10 12 9 16 8 9 10 13 16 4 5 15 ти 2 6 4 
15 4 947 в 8 i$ 3i 8 6 3 9 1 15 12 M 
6 15 з 115 3 6 10 4 15 9 9 13 2 5 15 16 
12 16 265 79 2:9 12 2 18 4 16 7 13 8 
51 5.11 3 10 1 110 1114 6 15 10 14 6 
8 13 8 зат nm 16 8 3 13 1 10 7 8 11 10 
9 ўа 10 5 1 11 1 5 14 16 16 6 2 1 9 10 1i 
1 9 14 4 16 16 4 14 3 4 14 10 2.4 12 74 
2 п 13 8 9 1 5 6 е2 29 €5 n3 1 a2 8 116 1% 
3 6 6 3 7 5 14 1 93 22 53 28 15 13 13 2 3 
4 14 412 2 2 15 9 (13 «6 ЙЗ «1 3 311 3 18 
1 18 12 9 10 13 11 а тт 3 qii 7 1014 6 2 
7 10 16 16 8 15 10 4 5 515 4 16 14 1 9 1 
16 12 15 14 4 6 3 8 6 9 10 12 п 8 4 115 
43 7 1 15 14 12.2 7 16 10 12 15 8 516 5 5 
10 5 8 13 13 4 13 15 1 11 4 16 5,6,6 „8 29 
14 8 7 10 12 9 16 12 15 14 H 5 м 9 2 4 12 
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4 8 
2,15) 4 Н 
1% 16 1 
5 16 1 HET : 
шу за 15) 16 121 ізі! : 
А : ов 9 15 15 19 be 
iH : i ЖЕ TEL H НЕН 9 13 12 
1 T 1 ; 76 18 7 16 
H I Е It А E 414 EG 2 11 
Pare 5 mn td Зав 16 16 141802 i 
it Pa НЕ 7 10 la 5 4 10 18 7 ER m 2 
12 16 4 4 18 i5 Ta H 
I Lid s 5 н 3 10 14 
X з 10 7 2 13 6 5% ar sh HE 
EE : : E ned 6 5 10 16 
i Ir $1 0 1 9 13 
2 3 15 13 w ора e s 2442 {4 
SIE Pi "Au 7 16 5 10 ш Ме 
п i м1 
1734358 16 1 гы * ү 
i uiu 12 її kig 
Idi : бй тй 14 11 10 Pi 64 
8 14 a 
1 4 13 b E НЕ HE 
16 8 16 4 n а 
6 12 15 i^: 2 5 в iii 
x y 10 2 18 14 
18 1 0 16 2 ( siat I 
1 5 4 
11; 252 53 7 155 ві шю Ін 
E { E: ERE 8 15 18 £ 
$ x 16 10718 105134 45-553 aie ae TIE 
$ i $ 31 азы up. E 
E rE Н в 21 14 16 
$ 4% РУ 16 10 15 3 ш Ps 
ч 4 15 12 9 15 ТЕ o5 в Pda 
2 H 8 4 
в ап 2 5 87 z $ 
fi отин в НЕГІЗІ Нин 
ии 14 16 7 Ims ZI : Zn 
n : 
TEER 1 0 16 15 ТІ it 1 
$ i Y. 16 1 
17452 Е е; 3 12 12 ipu р 
16 11 H Pas Hs 
jw ls к о sie ii | 
E i 5 13 mE 
12214 75 8*4 1 мя ты р 
16 is 
EERE 1 6 7 14 (o. ы $5 
з 3 3n 
4 10 12 1 Т HE FE | 
: Wis f 10 10 
14 12 8 E ІШ в ii | 
: - 12 12 8 
1 2 10 3 E di "e | 
2 16 Y 2% 27% 
n 6 18 E қас T T 
Ws 2и | ee Т 13 15 i 
die 6 16 1 ІР ва Ws 
9 13 10 8 14 Mri iz ов E T 
; E H 12 10 are 
ви 8 : м A | 
Hol УЕ E * H 
EERE 0 12 16 12 m 3 E 
з 3 5 M 572 
7 7 8 8 16 9 3 9 913 Е i | 124 
ФЕ в ГИ ө тим tns 24 
47% ; 16 16 Oy 
idi 2 зю iS nu 11 7215 
Е : ta 4 
16 2 13 8 8 ET р Ы 
: 2 1 10 4 BE 
КЕ 5 E t a n 13 12 4 P. Bn 
pt S 0 : 4 11 TU 
i NE 12 ур 82% 5 irs TH 
hd 5 Оа зуу PIPER 4 pr 
12 1 в qs w 4 14 и ^ 2 1 5 2 в 13 ii 
8 15 n s 
16 2 at в Ж | 
nos Baie ait 416 6 1 nucum TENE 
Mee E in 10 65 1l 1 2 во 
13 11 а sz ч ы 
Ae 1m 8 2 д 1 8 M 16 8 12 3 
ii f TERE 8 2 10 ве n ча 
2 13 п 2 3 31 3 181161 45 8 4 EE 
à 1 ЖЕ 1 13 56 
i 1% 3 6 10 pid $05 вв i ya 
о : 5 HE 6/71 54 5 10 13 11 14 
15 2 1 i’ £ 
16 14 15 ELE 2 $5 T. i Ё 
м 2 во E : 
т piy 2 5 2 71 LIH 3 ® 
16 1 RE 9 НЕ £ i 
+ З T 6 13 Е. 16 1 о i т n Tit. à 47% { 1 Н 
5 8 9 9 215 81516 nol б вы На E 
IRE вво 815 $ 419 18 1 О a 
Е Аы. Tx dn 1 10 3 2 11 6 12 11 
4 3 10 5 4 4519217 вв 2 зин 
трави и 1з и 15 м F 
u bB 16 2 15 зари 
12 7 
1: 2 5 
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TABLE 15.7 Permutations оғ 16 (Continued) 


10 9 10 9 4 4 7 2 5 16 6 13 15 11 16 16 10 16 12 15 
1 5 5:5 11 5 18 10 2 6 16 4 15 14 8 12 18 n 
€ 4 7 M9 9 15 13 7 13 $ 814 5 8 іш ry 734p 
9 6 15 6 10 8 614 1 5 15 9 4 8 11 9 1211 6 9 
5 10 16 13 8 2 3 1 4 9 п 14 5 2 10 3 п 15 3 2 
18: 2 6112 2 14 16 9 3 4 1 э Жш 7 13 14 14 15 13 
it sa) 5:6 5: 93! 8i 14: я i$ 3 4 £ з 12 2 5 7 
5 14 14 10 16 15 8 312 6 4 15 10 1 14 5 15 1 1 6 
Zi 81131 307 8 6 411 6 15 51 3 6 1 10 4 3 10 16 
8. 8 IR it if 12 9 4 15 8 8 10 7 12 13 49 61 7 
1 15: 3 16 15 3: 5 12 16 2 12 16 6 14 6 8 13 10 9 8 
8 1b 181 2 12 7 1 6 914 9 5 13 16 4 T 818% 
16 13 8 15 1 10 12 15 2 8 16 4 8 3 3 6 5 4 16 4 
14 12 9 14 14 1 10 16 10 1 fan ош 15 1 9 5 5 
3 16171. 18 13 13 13 1 1H n 14 12 12 13 9 1 16 2 2 10 
8 5:4 5 2 16 14 10 8 12 13 7 9 10 5 26 8 à 3 
6 5 8 12 16 171718 8 4 6 12 15 5 1 9 5 16 10 15 
16 1 8 18 14 161 41213 7 16 14 7 14 7 7 16 15 14 2 
10 12:17 7 5 115 3 9 5 6 9 15 4 1 8 14 12 18 
14 16 2 16 15 13 12 8 10 1 14 8 10 11 12 5 15 11 5 
3 10 10 6 3 1: 8 4 5 8 И б 8 11 13 7 3 4 10 
5.14 9 8 8 4 2 16 14 13 12 10 4 10 16 11 12 810 
15 13 16 10 9 8 9 6 12 14 15 1 6 16 8 4 6 13 1 16 
1 3 13 15 4 116 5 1 8 7 13 12 8 9 15 3474 
1 8 6 2 10 и 5 915 6 8 2 16 6 5 16 4 7 5 7 
12 4 15 M 1 2 3 9 ши 9 9 5 з 8 10 2 5 15 M 
T" jd 4 9 13 10 13 8 11 12 П 4 8 4 2 & 9 8 9 8 
4911 2 315 7 9 6 10 15 11 9 6 3 12 6 2 12 
dtp ва 9 11 10 4 2 3 n 1 13 13 $ 15 3 $9 Ж 
18 15 1 1 12 14 10 11 16 10 4 16 14 2 15 6 1 9 16 6 
9 412 35 6 14 1 7 16 2 3 18 12 14 14 3 10 13 1 
8 25 5 6 16 6 1 2 15 з 7 2 7 10 шипи 6 9 
i8 5 аи п 12 9 9 16 12 4 82 2 5 19 3 8 10 
8 14 5 2 10 1 баз т A4 183 2 15 15 11 4 6 
1 13 12 14 12 15 7 15 13 3 8 9 14 10 16 5 4 12 2 18 
4 15 М 8 13 5.6 7 6 5 213 3 9 7 и 7 10 7 % 
3 12 EN 6 6 м 4 8 141 12 8 7 16 1 14 5 16 10 9 
16 10 9\10 8 6 211 4 16 1 15 11 1 3 6 6 612 3 
1460 te 5 215 10 3 4 3 14 15 1 14 16 10 2 зи 
9 16 10 5 2 13 14 12 15 11 14 7 16 11 9 9 3131 1 
5 4 8 7 9 и 90 9 T 15 12 9 13 6 814 5 13 2 
19 4 іші 8 12 M 11 10 16 16 1 12 n 13 M 14 9 12 
$i Y 38 и 710 4 8 9 6 10 12 5 15 4 12 1 14 U 
14 8 M 15 7 10 16 13 12 15 51 8 7 13 1 16 8 6 16 
1511 6 в 3 1 1 16 1 13 9 110 8 8 з * 9 1 X 
10 3 8 9 14 9 13 2 5 6 10 2 6 15 10 10 13 15 5 8 
6 9 2 13 15 А 1:10 1 1n $ 4 4 4 3 1 415 5 
2 2 15 16 4 16 3 5 7 8 13 6 5 6 12 7 8 7 16 15 
9 12 16 8 10 7 4 15 5 14 и 3 8 15 13 16 14 1 1 15 
16 13 1 12 12 14 3 16 6 10 16 1 2 1 15 1214 3H 
€.» T 8 à Ж 6 13 13 4 2 5 18 13 16 7 29 9 15 5 
7 15 9 5 18 249 21294-27 10 15 3 5 10 4 13 8 16 3 
10 2 12 14 3 и 5 813 8 Ekinn 10 5 13 4 1 
11 16: #! & ® 16 12 14 16 13 т oz а з 12 12 10 7 2 
14 10 5 13 8 10 7 6 12 9 9 6 12 4 12 з 7 4 8 13 
à 8 85 Y 15 15 H 2 nu 12 2 15 2 16 T 8 12 10 12 
4 9 2 6 1b 210 8 9 6 елаз 2 9 16 тм 8 
8 4 710 6 8 10 4 8 12 814 6 3 9 14 1 5 6 16 
15 5 15 16 15 Ж og жоу 310 5м 6 15 3 15 1 9 
з 8 10 11 9 13 14 12 1 5 14 9 10 10 8 6 6 6 91 
6 114 3 14 3 6 5 14 16 3 7 9 6 4 13 4 11 13 4 
3 6 4 15 7 9 1 9154 15 16 11 7 14 5 15 16 2 7 
J9 11. 11 1'16 1 8 10 10 3 4 4 16 13 1 810 3 12 6 
5 14 13 3 2 4 9 7 з 15 2 81412 5 їп 2 5 10 
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TABLE 15.7 Permutations оғ 16 (Continued) 


io зї %/ @ 4 10 1211 4 10 16 6 1u ап 10 6 5 10 16 
1 6 2366 Hs L Bg 6 3 1 12 16 H 341 5 
6 10 11 15 16 14 15 10 H 9 5 15 15 8 6 14 5 12 15 38 
11 3 16 1 12 15 4 15 15 6 4 210 2 9 4 11 15 16 10 
7 16 15 10 10 11 13 13 7 4 101012 9 3 уа 32 2 
А4 8 2 € B5 6 11 16 12 1 ы т 9 ub 4 16 2 8 5 4 
14 4 8 5 3 4 5 14 1 12 12 11 4 10 12 7 12 M 14 14 
3 9 14 11 15 9 16 7 816 5 Z2 316 Т 1814 3 3 12 
2 14 5 M м 7 183 514 3 12 13 3 14 #1 10! Y 41 
9 2 3 3 18 276 2 9 % 447 7 3 i4 19 7' 18 7 
12 13 13 7 7 8 6 914 8 7 16 6 513 9 8 9 12 11 
15 1 9 & 1k 13 10 8 10 15 14 13 3 6 10 & 13) 1U T! 4 
15 12 6 4 4 5 & 1$ 10 4 8 5 5 13 5 8 210 6 6 
8.1512 8 2 16 9 5 2 13 $ Vi ид 6 4 2 9 15 
5 8 413 1 19 3 £X 18 5 18 14 16 15 15 8 10 13 1 9 
13 7 1012 9 2 8 6 6 5 18 8 4 8 15 9 6 813 
11 7 16 10 15 $ Lum 1 3 15 4 814 8 15 1 5 16 6 
+6 T 6 Ж 7 6 67 15. of 1012 ви 7 4 14 15 2 10 
TESE 11 зи 16 14 2 15 9 11 12 M 13 7 
0° 1 M | 9 $ ТЕ 8 7 15 5 1 7 13 12 9 13 
12 8 7 16 12 $ 37. 19 8 1 8 10 1 16 $ 41244 
6 15 4 8 8 5 9 13 9 10 69 415.14 % 19 2:8 8 
9 11: 0 15 4 8 2 4 6 18 5 3 413 6 8 8 9 1 16 
13 18 3 4 1 м 412 51 Д 5 Oh Wat 3 10 6 6 15 
14 4 11 14 4 10 5 10 12 8 в 1$ $ 9 4 12 9 n T 3$ 
5.5 2 9 9 13 10 9 13 15 2 15 1 10 15 2 15 3 10 5 
4 14 8 3 16 4 16165 4 t 4 214 4 5 10 8 2 8 15 
712 512 5 16 15 5 14 6 9 6 16 9 14 18 9 4 15 мМ 
8 29 2 14 112 8 4 4 13 11 13 16 1 6 11 16 11 8 
16 10 15 13 10 15 14 7 10 14 18 10 8.5 3 AA 5 T $1 
15 914 1 2 i13 18 15^ 8 $ 12 9 12 6 10 9 16 10 12 9 
2 13 13 7 13 Ww 39 8$ F 14 13 5 3 12 Y 6 1148 

зи 1 16 5 13 1 M 9 f£ т вг 11 101! + 
1 4 6 11 12 8 12 16 11 3 T E м ni 10 10 в ; 3 T 
инан инан Sadat ТЕ 
10 2 7 2 10 we 31 T 9 14 5 15 Н 19 H Ton d 
15 13 12 12 5 gt 2 ^0. 3 п 9 9 12 Н 
8 1 2 13 14 12' 1' 127 12' 8 6 4 104 L 4 11 3.18.9. 2 
іш $ 3 eru 3 в H Н 7 15 8 181 4 1 
14 10 5 3 9 13 8 6 16 
3 16 1 15 2 15 4 H у B T 1 3 15 4 $ $ 2 n n 
nid s t4 6 9 
6 7 9 10 1 taf 3 23 16 6 8 : т. 10 А $ 1 r 
13 6 14 16 16 10 5 1 $ T и % 4 16 { аа 6 
I 5 15 6, £f 16 1 
15 10, $' 8 7 3 5 15 4 10 4 10 10 20 2 1 M d ч x 
2 12 10 6 8 т з 8 15 

з 4 
8 2 1 14 10 à 2 "H3 5 е1 2 г 3 11 2 32% 
6 10 2 2 6 1 9 18 7 12 1E ЖА 4107 3 
4 15 8 49 8 7 

10848 10 4 7 4 ний 1 Ip iiid 
5 3 14 10 9 S" CRN + 19 10 1 AER 8 1 1 16 3 
10 15 7 16 13 12! 13. 45216 № 4 5 574% i3 8 13 14 9 
12 16 8 11 12 511 5 12 1 8 тірі 18 мі 9 1l 
16 1218 5 7 4 8 14 16 13 3 4$ 8$ 15 3 4 12 13 
Ik Б oL Е з 2 12 10 8 3 3 i14 10/7 5 16 8 10 1 
2 96 10 1 16 4 5:8 T, 9 1: $116 19/14 6 10 7 2 12 
24 6 19110 16119, 9f 0^ 7 5 E tee 3 34 pu 
9 11 1613 8 115 4 14 1 14 10 12% $7586 
i 3 9" = & 9 10 16 516 1 12 191 81 33 11 1614 4 M 
4 1 11 15 15 11 69 5 16 13 Ц ив 
пан ниша ваш ишу 
14 14 5 3 11 ee 1 
7 7 15 6 14 1578314 19 $9741 16 1 


Bones SH кь 


-- 


- 
ao 


LITE 


12 


Beon Eem 


коо мы 
ЗоотьФоч 


16 


RANDOM PERMUTATIONS OF 9 AND 16 NUMBERS 
TABLE 15.7 Ренмотаттомв оғ 16 (Continued) 


т т 13 6 15 1 12 11010 7 5 з 9 
4 16 10 2 16 16 3 13 12 1 3 15 1 15 
0 9 1 5 10 9 13 7 15 13 12 4 11 16 
10 12 813 6 2 6 9 9 15 13 1 5 12 
8 13 915 4 5 5 14 5 4 4 2 8 10 
к 71244 7 8 16 1215 10 4 5 
12 5 110 5 14 2 m 92: 6; 3 7 12 13 
5 3 М 8 112 8 514 8 5 3 15 1 
15 10 16! 8 11:10? 1 15 8 14 812 10 11 
16 8 12 4 2 7 10 4. 2 т 4/16 2 4 
14 15 6 1 3 15 M 411 $ 31 6 T $ 
13 14 5 14 12 13 9 12 7 9 1 18 9 8 
ТЕ 4 15 7 13 3 16 8 4 2 19 6 14 
2° 8 3.16 7 8 4 6 13 16 9 14 14 3 
6 2 2 9 8 11 15 1 6 6 16 11 16 7 
з 6 41 9 6 1 16 1 10 10 8 13 2 
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t test for comparing two means, 101 
Hyper-graeco-latin squares, as fractional 
factorials, 274 


T and J interactions, 193 
Incomplete block designs 
balanced incomplete block designs, 
376 
balanced incomplete blocks, 377, 
439 
balanced lattice, 377, 396 
incomplete latin square, 378, 507 
lattice square, 377, 483 
choice of incomplete block design, 388 


in sequential experiments, 265 
plans, 276 

principal block, 267 

randomization, 267 Я 

rules for constructing, 245, 247, 249 
statistical analysis, 253, 268 


arranged іп 
extended incomplete latin 
squares, 391 
incomplete latin squares, 391 
lattice squares, 391 
РЬ designs, 392 


table of 2” designs, 261 

use in practice, 259 

use of hyper-graeco-latin squares, 274 
2” series as first order designs, 341 

3” series, 271 

4” series, 273 

3 X 2" and 4 X 27 series, 275 


randomized incomplete blocks, 
390 
comparison with 
random controls, 387 
randomized block designs, 385 
split-plot designs, 388 
Systematic controls, 387 


INDEX 


Incon:plete block designs, composite sec- 
ond order designs, 353 
partially balanced incomplete block 
designs, 378 
partially balanced extended incom- 
plete latin squares, 507 
partially balanced incomplete 
blocks, 379, 453 
partially balanced incomplete latin 
squares, 518 
partially balanced lattice squares, 
484 
partially balanced lattices, 378, 403 
statistical analysis 
with recovery of inter-block infor- 
mation, 382 
without recovery of inter-block in- 
formation, 380 
Incomplete latin squares, 378, 507 
extended, 507 
index to plans, 520 
missing data, 512 
partially balanced designs, 518 
plans, extended, 538 
plans, r < t, 522 
plans, г > t, 536 
randomization, 508 
small numbers of treatments, 513 
Type III, Е = it — 1, 513 
Type IV, k — it +1, 515 
statistical analysis, 508 
Type I, with recovery of inter-block 
information, 509 
Type П, with recovery of inter- 
block information, 510 
Type Ia, repetition of Type I, 511 
Type IIa, repetition of Type 11, 512 
Type V, with recovery of inter- 
block information, 509 
Inference, statistical, 3 
Interactions, 150, 153 
confounded, see Confounding 
generalized, 187 
used as error, 218 
Interpretation, 1 
Intra-block correlation, in balanced in- 
complete blocks, 440 
Inverse matrix, 337 


Lack of fit, 340 
Latin squares, 117 
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Latin squares, completely orthogonal set, 
139 
efficiency, 127 
for estimating residual effects, 133 
missing values, 125 
plans, 145 
randomization, 121 
replications, number of, 120 
row added, column omitted, 516 
row and column added, 516 
row and column omitted, 516 
standard errors, 124 
statistical analysis, 121 
Lattice designs, 396 
balanced lattices, 377, 396 
missing data, 400 
randomization, 405 
statistical analysis, 396 
effects of errors in weights, 399 
in series of experiments, 559 
index to plans, 390 
partially balanced lattices, 378, 403 
arrangement of experimental mate- 
rial, 404 
four and higher numbers of repli- 
cates, 404 
missing data, 413 
randomization, 405 
repetitions of design, 409 
simple lattices, 403 
statistical analysis, 405 
triple lattices, 403 
plans, 428 
rectangular lattices, 415 
plans, 435 
repetitions of design, 421 
statistical analysis, 417 
Lattice squares, 483 
arrangement of experimental material, 
485 
balanced lattice squares, 377, 483 
in series of experiments, 559 
index to plans, 391 
missing data, 494 
partially balanced squares, 484 
plans, 497 
randomization, 485 
statistical analysis, 485 
(k + 1)/2 or fewer replications, 485 
(k + 1) replications, 489 
2(k +1) replications, 494 
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Least significant difference, 76 
Least squares, 48 

in handling missing data, 80 

in multiple linear regression, 336 
Level of a factor, 148 
Life testing, sequential method, 31 
Limits of error, 27 
Linear function, standard error, 70 
Linear response surface. 335, 338 


Main effects, 149 
subdivision into single components, 62 
Mathematical model, 45 
analysis of variance, 53 
covariance, 83 
first order designs, 338 
incomplete block designs 
intra-block information, 380 
intra- and inter-block information, 
382 
missing data, 80 
multiple linear regression, 336 
repetitions in space and time, 565 
second order designs, 342 
series of experiments, combined analy- 
sis, 549 
split-plot design, 294 
treatment effects, estimation of, 48 
Matrix, 336 
inverse, 337 
Maximum response, 354 
methods for locating, 355 
Minimax, 364 
Missing data, 80 
balanced incomplete blocks, 450 
balanced lattice, 400 
confounded factorials, 227 
incomplete latin squares, 512 
Jatin square, 125 
lattice squares, 494 
mathematical model, 80 
partially balanced lattices, 413 
randomized blocks, 110 
split-plot design, 302 
Multiple comparisons, 75 
Multiple linear regression, 336 
Multiple range test, 76 


Newman-Keuls test, 76 
Non-central composite designs, 343 
Normal equations, 337 


Number of figures, 60 
Number of replications 


for prescribed limits of error 
continuous data, 27 
proportions or percentages, 27 
for tests of significance 
continuous data, 17 
proportions or percentages, 23 
tables, 20, 21, 24, 25, 28 


Objectives of experiment, 10 
Optimum combination of factor levels, 


354 
random selection of test points, 366 
single-factor method, 356 
single large experiment, 366 
steepest ascent method, 357 


Orthogonal, 63 


components, 551 
latin squares, 139 


Partial confounding, 185 


Partially balanced incomplete block de~ 
signs 
definition, 453 
group divisible, 456 
statistical analysis 
intra-block, 456 
with recovery of inter-block infor- 
mation, 460 
table of designs, 392. 
with two-way elimination of varia- 
tion, 518 
Permutations, random, 569 
construction, 569 
tables 
permutations of 9, 577 
permutations of 16, 583 
use, 569 
Plaid square designs 
statistical analysis and plans 
2х2 X (2) factors in an 8X8 
square, 327, 334 
2х2 X (2%) factors in an 8X8 
Square, 327, 334 
Planning of experiments, 9 
experimental material, 36 
objectives, 10 
replications, number of, 17 
sclection of treatments, 11, 34 
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Plans 
balanced incomplete blocks, 471 
index to plans, 390, 469 
balanced latin squares, 134 
completely randomized designs, 97 
confounded factorials 
2” series, 186, 219, 234 
index to plans, 215 
3" series, 193, 221, 238 
index to plans, 215 
mixed series, 203, 224, 240 
index to plans, 215, 217 
cubic lattice designs, 423 
extended incomplete latin squares, 538 
factorial experiments, 175 
fractional factorials, 276 
graeco-latin squares, 132, 146 
half-plaid squares, 332 
incomplete latin squares, 391, 520 
latin square designs, 118, 145 
with sub-units in strips, 308 
lattice designs, 391 
balanced lattices, 428 
partially balanced designs, 428, 433 
lattice square designs, 497 
index to plans, 391 
partially balanced incomplete blocks 
index to plans, 392 
plaid square designs, 334 
quasi-latin squares, 328 
rectangular lattice designs, 435 
rotatable second order designs, 370 
split-plot designs 
BCD confounded, 313 
for a Х 2° factorial, 314 
simple, 293 
sub-unit treatments in latin square, 
310 
sub-unit treatments in strips, 307 
systematic arrangement, 306 
Pooled eror mean square, series of ex- 
periments, 548, 556, 560 
Precision, 16 
Principal block, 267 
Proportions 
in completely randomized designs, 103 
in randomized blocks, 115 
Protection level, 76 


Quadratic response surface, 335, 342 
canonical form, 363 


Quadratie response surface, interpreta- 
tion, 364 
examination of fitted surface, 352 
Quasi-latin squares, 317 
efficiency, 323 
randomization, 317 
statistical analysis and plans, 318, 325 
trentments applied to complete rows 
(half-plaid), 324, 326, 332 
treatments applied to complete rows 
and columns, 327, 334 


Random selection of test points, 366 
Randomization 
effect on correlated errors, 92 
function of randomization, 6 
more than 16 numbers, 571 
restricted, 8 
tables of random numbers 
9 numbers, 577 
16 numbers, 583 
tests of randomness, 571 
Randomized blocks, 106 
advantages, 106 
efficiency, 112 
missing data, 110 
randomization, 107 
standard error, 110 
statistical analysis 
continuous data, 107 
proportions or percentages, 115 
Ranked data, in balanced incomplete 
blocks, 440 
Recovery of inter-block information, 882 
effects of errors in weights, 277 
Rectangular lattices, 415 
see also Lattice designs 
Regression 
components of treatments, 161 
multiple linear, 336 
quadratic component, 563 
with proportions or percentages, 105 
with quantitative factors, 64 
Replications, 1 
for prescribed limits of error, 27 
table, 28 
for tests of significance, 17 
tables, 20, 21 
fractional replication experiments, 244 
in latin squares, 120 у 
in lattice designs, 405 
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Replications, single replication experi- 
ments, 218 
Residual effects 
bias in estimate of error, 139 
designs for estimating, 133, 139 
statistical analysis, 135 
table, 134 
standard errors, 138, 140 
Response surface, 335 
linear, 335, 338 
quadratic, 335, 342 
Reversal designs, 141 
Ridge 
rising, 364 
stationary, 364 
Rotatable second order designs, 344 
central composite, 346 
plans, 370 


Second associates, 454 
Second order designs, 335, 342 
central composite rotatable, 346 
analysis, 347 
numerical example, 349 
table of designs, 347 
composite, 343 
in incomplete blocks, 353 
analysis, 354 
in seeking optimum combination, 361 
plans, 346, 348, 370 
rotatable, 344 
Sensitivity testing, 30 
Sequential experimentation, 29 
life testing, 31 
t-test, 30 
up and down method, 30 
use of composite designs, 343 
use of fractional replication, 265 
Series of experiments 
all experiments of same design, 548 
analysis, initial steps in, 545 
experiments of unequal size, 555 
heterogeneity of experimental error 
variances, 553 
heterogeneity of interaction variance, 
550 
lattice experiments, series of, 559 
mathematical model, 549 
numerical example, 546 
orthogonal components, 551 
repetitions in space and time, 565 


INDEX 


Series of experiments, summary, com- 
bined analysis, 555 
treatments X places interactions test, 
561 
Sign test, 116 
Simple effects, 148 
Single-factor method, 356 
compared with steepest ascent, 365, 
367 
Size of experiment, see Number of repli- 
cations 
Split-plot designs, 293 
confounded designs with plots split, 
311 
repeated subdivision, 304 
standard errors, 305 
simple split-plot design, 298 
efficiency, 296 
example, numerical, 299 
experimental error, 294 
mathematical model, 294 
missing data, 302 
randomization, 297 
standard error, 298 
statistical analysis, 297 
sub-unit treatments 
confounded, 312 
in latin squares, 309 
in strips, 306 
systematic arrangement, 305 
Staircase method, 30 
Standard error formulae 
balanced incomplete blocks, 445 
chain block designs, 466 
comparisons among treatment means, 
70 
completely randomized blocks, 100 
cross-over designs, 131 
direct and residual effects, 138 
factorials, 167 
for linear function, 70 
graeco-latin squares, 133 
half-plaid squares, 326 
incomplete blocks, 380, 382 
incomplete latin squares, 509, 511 
latin squares, 124 
lattice squares, 488, 498 
lattices 
balanced, 398 
cubic, 425 
partially balanced, 408, 411 
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Standard error formulae, rectangular, 
422 
partially balanced incomplete blocks, 
459, 462 
quasi-latin squares, 318 
randomized blocks, 110 
rotatable second order designs, 349, 
370 
split-plot designs, 298, 305 
Standard errors, 17 
pooled error, 100 
rules for calculating, 70 
with variances heterogeneous, 100 
Star, in rotatable designs, 347 
Statistical analysis, 45 
example, numerical, 55 
general method of analysis, 45 
analysis of variance, 53 
assumptions, 47 
effects of errors in assumptions, 91 
confidence limits, 50 
construction, 52 
estimation of treatment, effects, 48 
mathematical model, 45 
tests of significance, 50 
Statistical inference, 3 
Steepest ascent method, 357 
compared with other methods, 366, 
307 
correction for errors in levels, 362 
direction of steepest ascent, 359 
first experiment, 357 
interpretation of canonical form, 364 
use of second order designs, 361 
Studentized range, 74 
Subdivision 
of chi-square, 104 
of error s.s., 77 
of treatments s.s., 61 
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Sum of products, 84 
Switchback designs, 141 


t-distribution 
approximate t, 101 
errors different, 100 
sequential t-test, 30 
"Taste testing, 440 
Technique, 35 
Test of significance, 4, 50 
effects suggested by data, 73. 
number of replications for, 20 
one- and two-tailed, 18 
Transformations, 92 
"Treatments, 11 
applied in sequence, 133 
cumulative effects, 141 
direct effects, 133 
residual effects, 133 
comparison among means, 70, 79 
components, 62 
effects, estimating, 48 
linear functions, 62 
means, adjusted, 37 
multiple comparisons, 75 
selection of, 11, 34 
sum of squares 
incomplete subdivision, 65 
sample sets, 64 
single components, 62 
subdivision, 61 
test of identity of a group, 52 


Up and down method, 30 
Vectors, 336 


Youden squares, see Incomplete latin 
squares 
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